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TITLE: Improved proteases and methods for producing them 
FIELD OF INVENTION 

5 induswJ" n , Umb6r °' miCr0b,a " y deriVe " ^ Pro ' eases are ^ «° P™»~ in 

Wtal am m o aod extens,ons and/or modifications of an existing C-tenrtnus. Tha invention 
further proves me resulting proteases suc „ amln0 ^ nvenlMn 

10 a Noc^o^LT"' T*" ^ '° ^ P0lyPep,it,9S hav "* «** -'='ed to 

a Nocanipps* sp. protease, and isolated nucleic aoid sequences encoding such proteases The 

ZT T*™* to nu * h add -~- — * - d - •* 

15 BACKGROUND 

n„,„ P0 ' yPeP '"'. eS hav ' n9 pro,easa «**• - Phases, are sometotes aiso designated 

nterna v W (endopepMases). EndopeptJdases show advi* XL a „d C 

" ^ ~ * * - ~- * ofl protet I 

incwjT ^ " Pr0,eaSe " h deflned hereln " a " ara) ™ a **»*— ^de bonds N 
subdasses thereof). The EC- number refe re to Enzyme Nomenclature 1992 from NC-IUBMB 

™ZT* Z dT r** ,nc,ud ' n9 ■ " p ™ ln • ~ 

z^3, 1-5, Eur. J. Biochem 199s 3^9 1 «• c... ■ »• 
B . , „ lsyi)l Z3Z - 1 " 6 . E"r- J- Biochem. 1996, 237 1-5- Eur I 

B,ochem. 1997. 250. 1-6; and Eur . Bjocnem . 199g res ^ ^ e 

nunciature ,s regulady supplemented and updated; see e.g. the Won* Wide Jeb ^ a 
http://www.chem omw. ae ..iM..hn.Ki, rgwm ^ nHAV h , ml) 1 VVVW; at 

US patent publication No. 2002/01 82672A1 discloses, that if one or two of the lest two 

oTk rr of a po,ypeptide is,are — - — ° - * •^55 

or K, R, or H <pos,t,ve.y charged), the tail would be considered polar, charged, and this makes the 
po.ypept.de res,stant against proteose degradation by a subCass of pleases that ~ e 
non-polar C-tenminal tails of secreted proteins. recognize 
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Anrther disclosure reported. M proiine rasldues a. the C-tenninus „ nascen , 
of nasoen, chams ,„duce SsrA tegglng during *an,a,,on ,em,i„a,ion. .BioLChen, 277 ^ 

5 

SUMMARY OF THE INVENTION 

It is a well-known problem in the art of expressing polypeptides having proteolytic activity 
that many of such polypeptides are inherent* unstab,e, they may be subject to LJL^t 
they may be targeted for degradation by other proteases a,ready during their produln esX 
10 « sub^ptima, yield, Many other factors may contribute to their instability, not ^ 

s^trrv - of 9reat interest to ^ de pr ° teo,ytic P ~ eS With - — 

stability, that may thus be produced in higher yields. 

The present inventors provide herein polypeptides of the S2A and/or S1E classification 

« ^tt c * teast *~ no,HMbr ° r -—^ •*» -* -* ^tt 

unclad 7 * ^ P0VPePMa - «"-*-»•» of the a. leas, three non-poiar or 

unchained ammo acd residues may he achieved by adding one or more amino acidM as a 

ZT""*' W - ^ "» Polynuc^deTas: 

« aLTT ^ "*** ^ one or more erfaing c 

tenninus of fhe polypephde of ma InvenBon resulted in much impraved yieWs as compared 
polypeptides that did no, have these Wemrinal amino add coMguraflons 

„ ... ACC °" ,in! "* a aspect «* "~"«°n relates ,o a secreted polypeptide which has 
a pha-ly* endopepBdase ac^y. which po*pep«de comprises a, teas, mree non-polarT 

(a) comprises an amino acid sequence which is a, leas, 70%. or preferably 75% 80% 

7* •* «. 93%. 04%. 05%^%. Ht£Z 

^J^l 8min0 ^ S6t|U6n0e * ma,u " ™ °< *• *own 
in SEQ ID NO: 26; SEQ ID NO: 33: SEQ ID NO: 47; or SEQ ID NO' 41 

(b) comprises an amino acid sequence which is at leas, 70%. or preferably 75% 80% 

*% Z£?-r 89% ' m% ' 91% ' 92% ' »*■ "« ^ 

2 am,n ° M<,UenCe * *" ,he mak "* * ^ polypeptide 

encoded by me pc4ynucleotide in SEQ ,D NO: 1; SEQ ID NO: 2; SEQ ID NO* 25- SEQ 
ID NO: 31: SEQ ID NO: 32; SEQ ID NO: 38; or SEQ ID NO- 40- 
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cum. med,um-h,gh. high, or »ery high stringency conditions with- 
(0 a potynucieofide encoding , the mature per. of e protease sew 

seq ,D no ' s: 35 and * tem 

Praana DSM 16548 by use of p nmere SEQ ID NO* 3 9 end 40- or fron, 
to-*** p^fca DSM 1 56 4 9 by use of pnn^ SEO „ NO's: 43^ 

SEQ ,D NO: 31; of SEQ ,D NO: 32; of SEQ ,D NO: 36; or o, SEQ .0 NO: 

(IV) a complementary strand of (I). (|,), or ( ,„ ); 
(a) comprises a mature part which is a variant «* ^ 

(e) Is en e«elicve>ientof(e), (b), (c), or(d); or 

(f) is e fragment of (a), (b). (c), (d), or (e). 

Preferably the polypeptide belongs totheS2A, ortheSIE peptidase femfes 

previous aspect PreSS,0n "* r * •* ""abuot as defined in «. 

Pofynu^reslTnedTnr^rasr 8 '° 3 *" ^ <* ^ "* — ' 
consbuc. as denned in the aX ' ' " eXP,eS "' 0n 
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JTT aspea " relates '° a ma,hod ,or i**** a p°iyp^ « 

defined ,„ me firs, aspect . the melhod composing; (a, cuifivafing a recombinan, hoe, oe« ae 
defined ,n me fourth aspect, or a hansgenic plao, or anlma, ae defined in the fifth or si«h aspecte 
fo produce a supernatant comprising .he polypepiide. and opfionafiy <b) revering me 
5 polypeptide. 

Omer aspects of then invention relate to: an animal feed additive comprising at least one 
polypepttde as defined in the first aspect; and 

(a) at least one fat-soluble vitamin, and/or 

(b) at least one water-soluble vitamin, and/or 
10 (c) at least one trace mineral; 

an animal feed composition having a crude protein content of 50 to 800 g/kg and 
compnsing at least one polypeptide as defined in the first aspect, or at least one feed additive of 
the previous aspect; 

a composition comprising at leas, one polypeptide ae defined in ,he firs, aspect, logamer 

Tc 3 5 T.t T ^ ^ am °" 9Sl «*«"• (EC 3 - 1A8 ° r 
(EC 3 2. .8); galactanese (EC 3.21.89); alpha-gaiactosklase (EC 3.2.1.22): protease (EC 3 4 - -) 

phospho pase A1 (EC 3.1.1.32); phosphoiipase A2 (EC 3.1.1.4); lysophosphoiipase (EC 3.1.1 5) 

PhosphoLpasa C ,3.1.4.3); phosphatase D ,EC 3.1.4.4* and/or bete-glucanase (EC 3.2.1.4 or 

„„,«,• 3 ^ T US "' g a ' ' eaS ' 38 defined ln •» flret *» ™P™""9 «le 

nu, bona, vaiue of an animal feed, tor inoaasing digesfibte andfor soiubie p,ote,n in a*ma, diets. 

TjZT nB , " hydrolyS ' S °' Prote ' nS in animal d,als - a ™" OT «" *• >-*nen, of 

vegetebte pn*™, me memod compdsing including me polypeptide^) hi animal feed, and/or in a 
composition for use in animal feed; 

a method for using at least one polypeptide as defined in the first aspect, compnsing 
.ncludmg the polvpeptide(s) in a detergent formulation. 

DETAILED DESCRIPTION OF THE INVENTION 

Proteases are classified on the basis of their catalytic mechanism into the following groups: 
Serine proteases (S), Cysteine proteases (C). Aspartic proteases (A). Metalloproteases (M), and 

aTZh ndr I" Un T ned ' Pr ° teases (U) ' see Handbook of eLU. 

£S Z 951 Woessner (eds)> Academic Press (1998) - in p — — 
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dans *Ld SA „ ^ ^ ^ into 6 

S evince (BanTe, 7'l^ 8 H J jT , * ^"^ "* »"""- 

.«^*.^crjzrrr: beion9 - 8,6 

(S2A). assigned to subfamily A of the S2 family 

15 in MEROPS classify as 'S1E* nm , » Handbook of Proteolytic Enzymes', are 

of Peptttase, ' ^ ^ " < ,993 > 

— m-r r^s:- - a * - - — < * - 

(a) proteases belonging to the EC 3 4 

a ' a w 106 •=** enzyme group; 

<b Serine proteases belong to the S group of the above Handbook 

toil serine proteases of peptidase family S2A- 

2 in ME^ P r teaSeS " P6PMaS6 faml " S1E M *"*- *> ^*° cn em.J. 290:205-2,8 (,993, 
* prrteasedafcbase. NucleioA^L £^ & ^ " W "» 

detern^eaoncenbeoan^oTmra^ M ~ te * Sue 

asaay-tempLu^ a ^kltel 1 7 1 ^ * ^ in qU6Sti °"- *™ - 
values are ptfT? 4 5 TH 9 « t *" Pr0teaS6 ,n qU6S,iOT - B ""*'" * a ^pH- 
* «* 60. , 60 e ! 0 90 £ V« ' ^ * — « * * 37. 
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Samples of protease substrates are casein, such as Azurine<:rossli„ked Casein (AZCL- 
ca e,„, Two protease assays are described ,„ Example 2 herein. eioter of which can be E£ 

eeee~ ■** For ,he ^ * ^ - — - - *-e.- 

' —JIT r " m " a *° nS 0 " ** ° rt9ln °' *" Pn>teaSe 01 8,8 lmen,l °" «*» «« use 
according to the .nvention. Thus, the tern, protease includes not onty natural or wTd Z 

ete hereof exh,b*ng protease activity, as well as synthetic proteases, such as shuffled 

7 ""^ Pr0teaS9S ' SU * «% Pleases can be prepaTae 

« generally known ,n the art. eg by SiteKfcected Mutagenesis, by PCR (using e PC^ Z™!! 

Mutegenes,s. The prepare**, of consensus proteins is described in eg EP 8MB tEE 

EEET 35 r herein in ™- "* a *- — - E- EEUEE 

EZ^ MUrce ' S ^ 8 ~ * - ^ -eted 

reduced" ZE * 3 —t «9ned to invoke a 

EE ™' Cal reSP ° nSe «*" » animats. inciuding man. The term 

mcreased levels of IgE ,n the exposed animal. Low^ergenic variarte may be prepared usL 

EE" IT m *" a * For ,he proteasa te * piEEE 

EE " * Pr0,eaSe ^ in a " -Ponse ConZEon 

wtth polymero may involve to vfto chemical coupling of polymer to the protease e « as ZSE 

m wo 06,17820. wo 9^0682. wo 0*35026. end/or wo garoo^g. .LEEEESE 
or atamaevay therote Involve in vivo coupiing of polymers to the protease Such EEEE 
b«**.eved by genetic engineedng of me nucleotide sequence encodktg the protease in^Z 
consensus sequences encoding additional glycosylauon sHes in the protease and exproeZ the 
protease in a host capable of glycosylating «he protease, see e g. WO 00*6354 ZEE 
~ * peering of the nucleotide seouen'ceTco ^ht 

oEE f T PWSaSe m ° n0mera an ° «-*» *>■ *• «M** of J 

oligomers. Such products and their preparation is described eg in WO 9606177 ll 

involved in an immunoiooica, response may be idemffled by various EE ^1 7, 2 Z 

display method descnbed in WO 00*6230 and WO 01/83550 or * h 

CTU "itosssg. or the random approach described 



(a) comprises an amino aoid sequence which is at teas. 70%, or prcferabK, 75% My 
... cw ,u r,u - **■ 5 =Q 'D NO: 47; or SEQ ID NO" 41 

en^Tbl ? am,n ° " SeqU6nCe * ** ^ matoe ■>« * <■» Polypep.de 
encoded by the poiynudeoade in SEQ ID NO: 1; SEQ .0 NO: 2; SEQ .D NO 25 s^o 

ID NO: 31: SEQ ID NO: 32: SEQ ,D NO 36: or SEQ ID NO- 40 ' ° 

(o) » encoded by a nuOeic acid sequence which hybridizes under' very low ^ medUln , 
low.med,um. medium-high. hi9 h, or ve^ hi 3 h ebinoency oondi 8 ons ^ 

3 "<*"■»*<«*• encodins a the matur* part of a protease said 
Precede oKainabte from „ DNA from Afccan^s a,ba DSM 

DSM 15647 by use of pnmers SEQ .D NO'S: 36 end 36: hen, 
£«. DSM 16648 by use of prime. SEQ ID NO* 39 end 40 ; or Z 
p ra s/ne DSM ,5649 by use of primers SEQ ID NOs: 43 and 

00 LT^T 6 * ' D N0 ' 1: " SEQ ID NO: * - SEQ ID NO: 25: of 
SEQ .D NO: 31: of SEQ ID NO: 32; o, SEQ ID NO: 36: or of SEQ ,D No! 

(III) e ^duence of („ or (It, o, a. leas, 500 nudeoSdes. preferably 400. 300 
200, or 1 00 nucleotides, or 

(IV) a complementary strand of (I), (||), or (III); 
(d) comprises a mature part which is a variant n f ~ * 
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I^IST 8 3 SUbS "' U,k,n ' ^ ^ — - - or 

(e) is an allelic variant of (a), (b), (c). or (d): or 

(f) is a fragment of (a), (b), (c). (d). or (e). 

5 ~J1 a.' T inVenfi ° n ' ^ " - — - 

me pro^C vTch m b6,Ween *" nUC,e0 " de * h — *- * 

soaring matri* BL^oTuseo teT ' " " "» — 

.0 used for nudJ^^^TTT f ^ *" "** ^ * 
---rnoe^Cnll^ 

for nucleotide. 9 p are " 2 for P°'ypeptides. and -4 

.5 Pea.cn <lAo, " PNAS 85 ^2448. and w. R. 

in Enzymoiogy \2^£?Z fT" ° mPanS ° n "* ^ ^ Mem °* 

J. Mo.. Bio,. IIZZ SmM, ' Watennan a,S ° rithm " T ' F ' Smlth - S. Waterman (, «, 
software (DNASTAR.no J7 f ' U * B ^^NE- MEGAL1GN™ 

p^X^Tr a? rjr wen% -* and ,he 

-pte-, ga P zz i^Tj^^r-rr para ™ tereare 

nucleotide sequences mav be d*™ h ^ 9 ° na,S " 5 " 1116 de 9 ree of between two 

P^seai^entparemetereereKtu^, gap^l ~~> * 1 °' 

po^ir^:^:* 9 -^ — - - — - ■ 

amino and/or eareo,* temflnus compared TZ ZZT, T 
30 embodiment a fragment enoodes at ieL, « 9 aC ' d 8eqUence - ,n one 

— , or a, J « J^^ZZZZT* v a ,<>0 amin ° 
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polymorphism within populations. Gene mutations can be silent a» ch a „ 

polypeptide) or may encode poiypeptides having alter^ a l! 1 ^ " * ***** 

^Pep^ 

The present invention also relates to Isolated oolvo«>iiri~ h. • 
S which are encoded by nudeic acid sequences «Z 22! ^ "**" 
mediurn, medium, medium-high, high ^J^l " ^ ° r 

. which hybridfces under the sal oolr T**"* COnd "*"' S Wi,,, 3 nUC,eic 

NO* 26 and 27; from Atocaretops* A6. DSM 15647 hi , ^ SM ' D 

10 from /Vocarefopsfe prasta rjj «T ^ T * * """"^ SEQ ID NO ' s: 35 a " a 3 * 
"ocandfopafs ZfZZ? lZ ^ *° ° M « « ««" 

pofynucfeo.de of SEQ ID N o- "^ SEQ ,D NO , ,T Q " N0 ' K 43 « 4 « <"> *• 

»•«*-«oNo r *:•;s^^i*. B ^ : • , '" Q " ,,o:a,i, *- o 

500 nucleotides, preferably 400 300 2on ,™ ' ^d""** of (I) or (II) of at least 

- ('). («). or (1,0 u Sam^ E r Fmsl an^ l M " ™ 3 *-d of 

««* 2nd ed*„„, Cold C^^'^tT TT ^ * 
^ probe !e sefccted fro* amlga, bhetJ^d'"^:^..^^: 8 ""^ 
polynucleotide oorrespondino to the m«..« sequencea of (a), (b), or (c) above. A 

25. SEQ ,D NO: 3,, SEqTno4 SEt""*" * ^ " NO: SEQ » N * 

me nude* IIZ^^, "^^ IT^^ 
NO: 32. SEQ ID NO- 36 or SEQ ID NO- „n 2 ' SEQ ,D NO: «• SK > » 

sequences of SEQ D NO 28 SEQ NO ^ K,U6 "» * - we,, aa me amino add 
41. or a fragment J£ and <^ a oil D ■ ' D T * "° ' D "* * " SE ° » N0: 
Hm genomic DMA from Nc^ZL ZZ 4^ b ° ~ 

I 27; from /»oca„*bps« Mm DSN ' ,tZ h ! ' ° f p *— SEQ ,D NO * 26 and 
■ . _ . 15 647 by use of primers SEQ ID NOV as «. 

" PSS DSM «« by use of pdmere SEQ ID NO'S ^ 9 and an- T 

fe»*P*| M ».CS M 15649 by use of primers SEQ ID NO'S 43^ « k' 
thereof, may be used to deslon a „,„*„-.. 40. or a subsequence 

polypepbdes having projTa^C a T * *~ DNA —» 

memods we., known to ma «?2.T ^ "* ■""* " ^ acco *= » 

genomic or cDNA of Z 1 ' T ' *"*" 66 — » «* - 

procedures. In order to ^11,!^ ' ntereS '' fo "° W '' n9 StendM S 0 "* 6 "- 
considerably shorter m,n Zn^ Ending gene herein. Such probes can be 

and more preferably a, ^ 35 2 ^ * ^ 15 ' ~* 31 least 25 ' 

and RNA probes can 2ZL r^T " ^ ^ ^ ran al90 te — • ■* °NA 
ba used. The probes are typically labeied fer deteaing me corresponding 
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gene (for example, with 32 p 3 h 35 cs 

present invention. ' " aV "" n >- Such probes <™ encompassed by *. 

« polypep.de having pwJTSl gL " b ~ and ** «— a 

separated by agare^ tr^SE^T? IT "~ «-» * 

other suitable carrier material In ™* . ? fransferred <° «■ immobilized on nltrooellulose or 
10 NO: 1 or a subse^^ta ' *" " *** * *- «■> « 

ft 1. ■ *. present ^ ^l^rT' 8 ^'"^'— 
Isbeled nucleio acid probe oonesoondL , ^ ' C aCW SK|Uence *> * 

complementary strand oi TTuC Z ^ ^ SM "> * 

cond^ns. Molecutes to which theTuc^c ZZ£T h 7 """""^ 
detected using X-ray ffl m . P hybn< " 2es under ««• ""Won. are 

stringencies, 35% forrnamide for M , - fomnam.de for very tow and low 

, <« ■« agency,. more ^X ^^cZ^ ^ " ^ " 

' feaSt « S ^ <medium shingency, more Prare^ l^.^'T ""* ""^ " 

even more preferably at least at 65-c , (mediunvhigh stringency), 

* higbsWngency). ^ ° * n ^»' a "<< P^-ebly a. leas, « 70-C (very 

10-O below me caiculated ^7^'^ " - * B a, „ to 

30 PH 7.6. 6 mM EDTA, 0.5% NP-40 IX D«„h„H, , ' M NaCI ' 0 09 M Tris -HCI 

sodium monobaac ^^Z^Z 0 T°V S ° d " ,m 

Soumem blading pro^. ^ of yeas, RNA per m, following srandard 

35 «*» 6X SSC a, 5-C to 10-C below me called " d **» ea * * « 
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The present invention also relate »~ • . 
5 small deletions, typically of one to aLJTn 9 of ^ P rote{ ": 

(SMamme and aspa ragl „e> hvdl^T aSPartic add >- »** "*» aelda 

sp^cac 8 . lyare ^:tn rt atarict tons t * not 9en ^ ,he 

« 1979. ,„, 7»e fttte/ns, Acade* Press N« v^ !* ~ * * ^ " d R L » 
residual activity of a corresoondino « m m . ' 1 ,east 60% - as compared to the 

The above taxonomy is according to the chanter tk . 
Ganrity & J. G. Holt in Sergey's Manual of SvTJT? * Manual b * G M " 

1. David R. Bone. Richard ™> -cond edition, volume 

It will be understood that for the afore 
•he perfect and .mperfec. states, and other CZTj^T enam ^ a ^ 

•rac equivalents, e.g., anamorphs. regardless of 
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• Mfcroorsanismen und mJZTq£ £S\. C £'? ^ ^ ""^ ™ 
and Agrtcrturai Reseat Se^ce Pa^ cT ^7""^^" Vo0r a**""**". (CBS,. 
(NRRL). Eg.. Afoca*^ toZ^ZT^TZT"" Re9 ' 0nal * Ce " fer 

DSMZ (Deutoche sa,nmiu*,3Z ^^^^^ b, ^^**«» 
10 IMRU 12SQ, NCTC 10489. *>«• *POstoy Institutions as follows: ATCC 23219. 

mentioned probes. Techniques for «»,»«„„ compote, water, etc., using the above- 

in *e art The nuole* ZZZ^ZZZZ ^ " k "°™ n 

«• oDNA library of anouw ""^ ~" " ~ <* 

has bean de.eo.ed w«h *. ^(7 '^lo „ Se< *' enCe "-*■ 3 ^ 

^-whfoHar.^^^^r 1 '^ ^ ° r ^ * 
sap/a). " ary Sk "'. ,n the art ( s ^. e.g., Sambrook ef a/., 1 98 9, 

P-sferab., abou. 60% p re ^ ^ " ^ <°* -ore 

W end even mos. pr^bZoTL ^ "* ^ »"*** *«« 90 % 

**pepudes ^2 ,K6bySDS - PAaE - 

fused po^das or 0^7^^ ^ ,nVe *" *° 
N-terminus or the C-tenninus «# «- . polypeptide is fused at the 

Produced bvfusinoa^^ A .sad poi^de „ 

a nucleio add sequence (or a nor. JIT a * ^ °° en °° din9 another 0°Vpeptide to 

^ion polvpepudl are J™ nT.^ TjZT ** ^ 

PolypepSdes so M ^ „ h ^ " *• » dl "3 -*■«-. encodin 9 the 

JZ^T* ° ,,hS *- under 

non-poiar or uncha^d; eventoTj^ " " 0 " ^ "*» «» * <-> 

more of Q, s, V. A. or P ^ ** " more •* hd an *o add(s) is one or 
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tu rr. ou tp. LP . r, ,o. op P^To7ot aTpo™ 9 ** ° WVQWP - QSAP ' QP ' 

5 even ^^j*"*"' *" *• ^W*^ of the present inverts we* produced r„ 
aspect CZtT" •' Pre 'T d emb0dimen ' to *■ <o ■» M 

xtr: vestt *** a ""^ ~ « - ~ 

P W*pt,de of the mvemon, a syne^y was achieved and a greater yield resulted 

asnec ^ ' Pt * feTOd emb0diTOnt - *• relates to the polypeptide of «he first 

wth the am.no acd sequence encoded by polynucleotides 1 - 81 of SEQ ID NO: 2. 

Nucleic Ari d Seq usnrra 

present invention also encompasses nucleic acid seouerJl . „ MM 43235 The 

the amino add sequence of amino «T£7 « 3 **** 

from me correspond^ par* of SEQ ID 2 * * " SEQ '° ™ * — » 

The present Mention ateo ■efctes to subsequen^^ !T T * ° f *" ^ 
Polypep.de fragments U hav e protease X «- which encode 
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least 225 nucleotides r^^ZT 7l 3 «*• « 

450. 500, 531, 500 VTaoT^ ^ -east 373. 

S nuclei apneas ^ l^a ^ J^T* ^ ^ inVe * n - — <° 

NO: S, 87. 33. 30. 00. 91 . 02. 03. 1. 05 i", 0^7^ ^ ^ ^ 

10 connate, thereof The "JTZ 1 " 9en ° m,C ^ •"•""■""> *»» *NA. or a 
genomic DNA ^7^7! o t u 1^ « "» ^ ~ *- 

features. See. e.g. Innis at a; 1 990 pcr- a rw^ ^ fragments with shared structural 

15 ligated activated transcription (LAtl »„rt n,^ ~T 0 Cham reactk>n < LCR >. 

be used. The nucfeic aJdXl la»t , SeqUe " Ce * ased "*—■« (NASBA, may 
rotated — and ^ T^LT* t 3 ** <* - — r or 

encoding region ofthe - JLTj^T " " ^ ^ ° f - 

a. teas, ebout 40% pure l^ZZl^ZT*' * about 20% P-W* 
about 80% pure. 1 ."T ^ "* W «« "»» I"*** a, leas, 

elecaophomsis. For J£ *° - 90% " "V aga^se 

oloning pmcedums used! aanlo Se<,UenCe * <***<« * **™<a,d 

* h ypepnae, insertion of the fragment into a vector molecule an n i„ 
the recombinant vector into a ho«st r»n »^ . molecule, and incorporation of 

sequence wil, be ^.T^LZ Z^ZT " " — * 
» ^'^.~ ongiaorarvcombina0ons ^ ™» te of genomic. cDNA, RNA. 

•em, -subs^a,: IlnT fTe p^T ^ '° ^ ^ 
- ,ts na.e source, e,.. ran ,„, ,ha, diri, , n J* J^^^-- 
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allergenic^, or the like. The variant sequence may be constructed on the basis of the nud*. «*. 

^rr as - m * - ^ - ~ ~ 

cent,, Tr «— * ■» nucteic acid sequence 

eonesp nd te fte codon usa 3 e of ft. he* organism intended for product of L protease 

For a general desenpbon of nucleotide substitution, see. e. s .. Ford a, a, 1 991 Pmte ,„ 

rzr ~ - * ^ , ~- ■* ~ - 

Ihe J^ZZT?? 8,088 Sk " l8d 0,6 ^ tf,a ' SUCh —■"■«—" be made outeide 
r ^ °' *• P0lyPeP " <i8 * *• •"*•«■ nudeic acid 

tegenests (see, e.g.. Cunntngham and Wells, 1989, Science 244: 1081-1085). In the latter 

^ZTmuCZ T Wn *" - 8 ' ^ **** »"*» - -La^n te 

~uM mute* molecules are tesled for protease activity ,o identity amino acid residues that are 

Ttet- d T *' " n,0te0U ' e - SlteS * — intemcon ca72te 
n^TT V"** 01 ,h8 » d—d by such te^C at 
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preferabty --u^Zl^^dZ mediUm SWn9Sney — "~- — 

colons with <n a po^Z^l"' T ' T^* W * ° r "* *W 
polypeptide fragment which has protease activity q ** 6nC ° deS a 

pHmers. a mutated ptasmtd eonLlTZ O 0 '*™™'- On incorporation of the 

procedu.ee Known in ^7™ 2 b T:* 0 ^*^^ * 
Nucleic Ariri Consfmrte 

the expression of the coding seouence ^sIh k . Se<,UenceS *" *** 

contro. sequels ExpresJoZZl 1- K " 1 *'° nS «"*■«• — *• 

^P^^!^^ -fttanschpttca, 

--^^~rr^^-* - — * — - 

mom naturally occurring gene or which has been modified to 
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coding sequence of the oresent in*»n««„ tk 5et > u ences required for expression of a 

S nuaeic a* sequence ZZ^LZ !l ^ 9 " " — -* « « 
boundaries of B» codlna seouL. ^ Ue " Ce * protein The 

rr determined by a ribosome wndi " 9 ste 

frame at ft. , J«££ZT? ' *"' - 0pen 

o end of the mRNA and a transcription terminator sequence located lm*t Hnuut 
of the open reading frame at the ? end of the mRNA a codl„„ ZT *^ d °*™*Mm 

> ™*d ,o. DNA, cDNA, and recorder* nocT^^es ^ *** " ' S 

n.an W ltr ra XZT^aT" B ^~^^^'-'» 
nucleic acid sequence pri^ ^7 °" * "* °> *. 

»n the expresZ vZ ZlT * ~ »"»»•**-• -«---y*p-*, 

recom^o.Anjra^riTJ™ 9 ""^ ~™ — 

necess^rtL^Trr " ^ * '"^ * - - 

con.ro, seUnctnCTne^r o TT °' * * "» * «• Each 

soon ooi s^^^r^r 77~— -»»p~. 

propeptide sequence, promoter stonaTLTd. ' ^ p0l!rade "« fo '' 

minimum, me conw sl^es 21 ^ M a 

signs,*. The com™, ZZZ ZT T^*' "* — <«nslat,ona, stop 

apace ~nr22LTL^ T "* " nkere " * ~ * '* 0 ^ 

nuceic add sequenc* en^Ta 1 ™^ *" C0 * 9 "*» * •» 

connguradon ,„ ,Z ™ ^ ""^ —* * herein as a 

coding sequeltme Df^A * """""^ *" d « 8 P osi "°" to me 

Polypi ~* *- «« — sequence directs me expression of a 

- is T^Z 7TZZT approprfate — a - «— 

sequence lalns Cp^ * 

Pdlypepdd^e promote JZZZZT " medlate "* * «" 

in the host cell of choice inchl T ^ UenC6 "^^ptional activity 

from genes 0^^ ^ , ~ « * *— 

to the host cell. CellU,arori * a '*«'"ar polypeptides either homologous or heterologous 
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of the JZ, . P " >m0,ere ** *• *>™***<x< <* •» «** constructs 

of me present .nvennon. eepeciatty in a bacteda, host =e,l. are the promoters attained fro^tet 

6 amylase gene (amyM). Badte amyfo/^fec/ens alpha-amyiase oene <ljrTT 
~s peniCinase 9 ene (per,, Ba*s ^ ^ ar^Zs.^^ 

^t^7^zt s :t 2 tj:t: (DaBoer 61 1983 - pr ™ - 

in w ociences ubA 80. 21-25). Further promoters are described in 
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of the fT* °' SUHable Prom ° tere ,0r direainS ** Option of the nucleic add consfructe 

r/rr n in a ammm »~ ho * - « «*— site 

Please, 0<yMe Mose ,, omerase ^ a3^ta 

M. or^ae mose phosphate tsonwasa,. and m «ant boated, and hyodd ^ 
c^Z'JZ <ENO naT m0,erS *" ^ ftDm - — " 

The controi sequence may aiso be a suitable transcription terminator sequence a 

preferred terminators for filamentous fungal host cells are nht^n *~ >, 
>4«toemi7/ifc T .„ 4 y rei,s are ob tamed from the genes for 

protease. "cosiaase, and Ft/sawm oxysporum trypsin-like 
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5 Preferred terminators for bacterial host cells, such as a Bacillus host ~i. 

terminators from Bacillus /w™;,™™- , u Bacillus host cell, are the 

iiwm oac///us lichenrfoimis alpha-amylase qene (amvi\ o -„ 

~rzriz ==rrr -j^zr— 

Sacc/7aAD/77yces cerevisiae alnha-fe^r , ~ H'»u*pnogiycerate kinase. 

^^LIZ^ZZTJ a po " adeny,aSon 8equence: a — - 

into the celTs secretory pathway n» 6 „h rf hTT """"^ P 0 '"*" 0 * 

may inherent,, contain a s^at jle c d " Se " UenCe 0f,te "* 

whh «» sjL o, lzz zz^: 9 jzz Iinted ,n ■ rans,a "° n read,n9 — 

» end C the coding sequence ,Z ZZT.T? * *• 

quence may conta,n a Signal peptide coding region which is foreign to .he 
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to enhance , ... ' . .. " n * u,ra ' s,9na ' «>*>9 -egton in order 

present invention. * 081 ° f choK5e m ^ be use <- in the 

Effective signal peptide coding regions for bacterial host cells are the sional n.„«H w 
reg.ons obtained from the genes for BaclHus NCIB IISsTlXTn- T ° 
stearothermophilus alpha-amylase BaciUu* ** * ™ltogen.c amylase, Sacte 

- ^se.eac^^ 

prsA Further sionai Proteases (nprT, nprS, nprM), and Bac///us sufttffe 

L are desc * ed by Simonen an ° p — i-***- 

is «*rn«w«£J ^^^TAKAarntfas^ape,^ 

- a ^: are :i a ::rzr a ~r rtase - — ^ ^ — 

fnaCve and can be conjee a ^^^^7^ " * 
of the propeptide from the propofypepMe The oro„Z - " aUt0Cata,),tiC cteava 9 e 
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chem,ca, or physical stimulus. inch.dlng the presence of e regulate* compound Regulatorv 
systems ,n prokeryote systems indude th. fee. ^ and ^ opeMm ' £ 

nigergiucoamylase promote, and As m ory.se g,ucoam*ase prZe" 
- used as regulatory sequences. Other examples of regulate* sequences are those which aZ,« 

IZZTT 1 ~ ■** - ^atered^glrht 

ope^dir^o^rr* — e ^ ** — - 

Expression Vprtnrc 

acid seruen^rr 00 " ^ * reMrob,nart «— " "uctelc 

^rrnr pesen, . ,ra,en,ion ' a •~- « and ^ 

— r ~ - ,he — — - * — ' - - - ---- 

be conItr't n !l eXpreSSl0n 3ny «■*• a <*-«« - «*"») which can 

be convene subjected to recombinant DNA praceduras and can bnng about the JnJLZ 

^te w7h T ZT* ^ **• " *• » « - compaZToMh^ 

The vector may be an autonomously replicating vecor, a vector which exists as an 

rETri*?* *• rep,ica * on ° ,whichis indepandart - - ~ n ,r 

^* an exfiachromosoma, element a minichromosome. or an artificial chromosome £ 

^ 15 ' nte9ra,ed "* "» 3— - replicated togethet 
with the chromosome(s) into which it has been integrated Rnrth™. • , ,0 9ather 

genome or tne host cell, or a transposon may be used. 
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The vectors of the present invention preferably contain one or more selects h, 
wh,ch permit easy selection of transformed cells. A selectable marke Ta a I T 
which provides for biocide nr v« ra , • . marker is a gene the product of 

5 SUD «s or Bacillus ^n^l TT ~ *" 

LYS2, MET3, TRP1 and u^3 ^Zl t '° ^ °* are HIS3 . ««• 

Include, bu are no, llLTLSTT 3 host - 

reductase). (dSZTJZLI^ ^ ^ PhTOpho,ransfera «>. *•> (nitrate 

the amdS and ovrG OPr »c «f * •,, ^referred for use in an Aspergillus cell are 

S ^ XZ. t ' - " «"~ « - *r o, 

15 celllndependen, 0,0,. genome — "*-*» of the vector h , he 

r=~— ====== *«= 

Integrated h* the £ ^„ ? ""^ ^ — * ■» «— to be 

con* a auttcj ^LTZ^T^Z^ ' — * 
1.500 base pars, and most preferably 800 to ^TJTl „T J""™ ' P ' e,erS *" y 400 to 
5 the conesponding targe. JL^ . ' WhKh are "** homologous with 

integral, ^l^Z^ZTT " Th9 
Senome of the host eel, FuZT" 2 T ^ "* *" ^ ^™ in *• 

encode nudeio eoidZJeCT.h. ^ e ' emen,S ^ * n0n - enMt '^ ° r 

odgins o, rep»ca,on are the odgins „, repiicln 0 pte Tmt ^TJTT " 
PACYC184 pemrrcng replicaeon in £ co», and pUB,,o pE ^ prlfr^ 1' I*™ 77 ' "* 
rep.ca.on in a-ta fctamptes of odgins of Ul^Ta^aThT . 
micron origin of replication ARS1 ARSa >. yMst "<*' «•» •» me 2 

n. ARS1. ARS4. the combtnatjon of ARS1 and CEN3, and the 
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combination of ARS4 and CEN6 The orinin «f 

ff» NaflonaMsa^j, of Safeacea USA 75 . 1433) ' ' ^ Eh '" Ch ' 19 ™' P ™*««»9S <* 

^uence where cells containing amplified copies of the ^i^ki^ m l 

Accordingly, the invention also relatp* * . 



Host Cells 



35 



Polypep,^. a vector co mp Hsi„ s a nude* add seance rtZ^TT - 
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extra-chromosomal vector as described earlier The term -w 

and as source. Tne ho^os. mat L Up °" «» «■» *• Polypeptide 

» or gram negate tect l £ * " S " S " BPtomyCeS 

embodiment the bacterial bo« „ ™" PMU *" no "« *>■ In a preferred 

an a^ophfc fl ^ M "*•*-* *• a-*, ce, Is 
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11111^ • g " 9 3nd Coh6n ' 1979 - Molecular Genera/ Genetics 168- 

(see. e.*. Koabter and Thome. 1937. ^,0,^^,69 5771 ^4, TKe3 T 

Press. Cambria. «T-^^ C^J^ T °? * 

171, and a. mltosporio ( Ha^„ ^ ^ * * ^ ^ 

here, «:r™r:~o:r , ho r is a yea * - - - 



10423.000-DK 



5 



10 



15 



20 



25 



30 



25 

Activities of Yeast (Skinner, FA, Passmore, S.M., and Davenport RR eds Sec Ann 
Symposium Series No. 9, 1980). ' * PA Sacte " 0 '- 

Sac n h0St ^ ^ be 3 ° andida ' Hansenuia Kiuyveromyces Picnia 

Seccharomyces, Schizosaccharomyces, or Yarrowia cell ' 

fi, am J he T 93 ' T ^ ^ be 3 fi ' amentOUS «* ^amentous fungi" include ad 

r 5 iTr r s r v,sion Eumycota and °° m ^ * *— * hJlts; 

T The h fitementoUS *"* are ^cterfced by a mycelial wall composed of chrtin 
ceHulose g,ucan, chitosan, mannan, and other comp,ex pdysacchadde, Vegetal grolist 

Z: 1 7t s T d r bon cataboiism is obK9ate,y -«* * ~ ~ e g i: 

Sa.'ST* Mucor, Nem ^ 

tr^r T ^ 56 tranSf0rmed b * a Process Involving protoplast formation 

S ^eTl ^ ProtOP ' aStS ' ^ re9enerati '° n ° f the Ce " - * ™ 

yZ e 7 r p~"°;; f " pe * s host ^ are d ™ * - 238 « « 

e ion ef al., 1984, Proceedmgs of the National Academy of Sciences USA 81- i<i7n_i^ 

totk, no et at, 1983. Journal of Bacteriology 153- 163- and Hinn«« 

^aMcade^orso,^ USA 75:7920 ' ' * 

invention ^ ' nWen,,0n relateS to 3 """""""^ h °« «" «-»Prtsmg a polynudeotide of the 
T" ^ " P0, ~<* — « <* Invention. ,„ a £Z 
embodiment, the recombinatrt host cell is a Bac/tecelL C«enwj 



Plants 



the prcsen, invemion so as to expnlTod^I ■ ^ ^ """^ "** ° f 
polypeptide mav be recovJrtT^. , P°MPeptide in recoverable quantities. The 

polypeptide may be used as such for improving the quality of a food or 
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feed, e.g., improving nutritional value, palatability, and rheological properties, or to destroy an 
antinutritive factor. y 

In a particular embodiment, the polypeptide is targeted to the endosperm storage 
vacuoles in seeds. This can be obtained by synthesizing it as a precursor with a suitable signal 
pept.de, see Horvath et al in PNAS, Feb. 15, 2000, vol. 97, no. 4. p. 1914-1919. 

The transgenic plant can be dicotyledonous (a dicot) or monocotyledonous (a monocot) 
or engineered variants thereof. Examples of monocot plants are grasses, such as meadow grass 
(blue grass, Poa), forage grass such as festuca. lolium. temperate grass, such as Agrostis and 
cereals, e.g., wheat, oats, rye. barley, rice, sorghum, and maize (corn). Examples of dicot plants 
are tobacco, .egumes. such as lupins, potato, sugar beet, pea, bean and soybean, and cruciferous 
Plants (family Brassicaceae), such as cauliflower, rape seed, and the closely related mode. 
organ,sm Arabictopsis thaliana. Low-phytate plants as described e.g. in US patent no. 5 689 054 
and US patent no. 6,1 11,168 are examples of engineered plants. 

Examples of plant parts are stem, callus, leaves, root, fruits, seeds, and tubers AJso 
specfic plant tissues, such as chloroplast, apoplast, mitochondria, vacuole, peroxisomes and 
cytop.asm are considered to be a p.ant part Furthermore, any plant cell, whatever the tissue 
ongin, is considered to be a plant part. 

Also included within the scope of the present invention are the progeny of such plants 
plant parts and plant cells. 

The transgenic plant or plant cell expressing a polypeptide of the present invention may 
be constructed in accordance with methods known in the art. Briefly, the p.ant or plant cell is 
constructed by incorporating one or more expression constructs encoding a polypeptide of the 
present invention into the plant host genome and propagating the resulting modified plant or plant 
cell into a transgenic plant or plant cell. 

Conveniently, the expression construct is a nucleic acid construct which comprises a 
nucle,c acid sequence encoding a polypeptide of the present invention operably linked with 
appropriate regulatory sequences required for expression of the nucleic acid sequence in the plant 
or plant part of choice. Furthermore, the expression construct may comprise a selectable marker 
usefu. for identifying host cells into which the expression construct has been integrated and DNA 
sequences necessary for introduction of the construct into the plant in question (the latter depends 
on the DNA introduction method to be used). 

The choice of regulatory sequences, such as promoter and terminator sequences and 
optionally signal or transit sequences are determined, for example, on the basis of when, where 
and how the polypeptide is desired to be expressed. For instance, the expression of the gene 
encoding a polypeptide of the present invention may be constitutive or inducible, or may be 
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T^ZT' T " T and * e m produa ™* ba ^ <° • 

or plan, par, such as seeds or leaves. Regulatory sequences ana. for example described by 
Tagueeta<..,98a.P/a n tPbysfoto a y86:5u6. pre, oescnbed by 

Cell 21. 285-294). Organ-spec«c promoters may be. for example, a pramoter from storage sink 

such as seed, pototo tobera. and fruits (Edwards 8 Cora*, iggo. ^ * eK 
27^303 . or from merabolic sink tissues such as meristems Oto ef a/.. ,994. P,e„ Mu a* 24: 

1 1 ^ PrOm0ter * UCh M ^ 9lUte,in ' " ro,aml "' S'obuHn. or albumin promoter 
ST^i T ^^^S* 885.89), a pramoJfrZne 
legumm B4 and the unknown seed protein gene from Wcfe raba (Conrad e( a/ 1998 Jo,,™; „r 
P,an,P» y « 708 . 711 , a pramoter from a seed o» body protein cCben e ,Tl£. * n, 

othe seed specie promote, known in the an. e g., as described in WO 91,14772. Furraem,ora 
fl^uka e, a/., 1993. Pten, P/lysfatoa) , 102: 991 . 1000 ^ 

me^transferaae gene promoters andHiggins. 1994, flW***^ 35.33 or 
■.MP gene praraoterfrom rice (Kagaya e,a,. ,995, Mofe c*ar and Genera, GeLcs 248: oZ 
874,. o a wound mducmle promoter such as me pototo pin 2 pmraoter (Xu a, a,., ,993 Ben, 
Molecular Biology 22: 573-588). 

protease^nllT"^ ^ ^ *° '° a ° h ' eVe hi9har » - •» 

pratoase ,n me pianl For mstence, me pramoter enhancer element may be an infron which is 

P aced be^, me pramoter and .h. nucleobda sequence enooding a polpepbde of 
~. For mstence, Xu a, a,. ,993. a^ra dfcetose the uae of the frra, Intran of the rice^n , 
gene to enhance expression. 

Still further, the codon usage may be optimized for the plant species in question to 
■mprove expression (see Horvath et al referred to above). 

The selectable marker gene and any other parts of the expression construct may be 
chosen from those available in the art. V 

convent^ uT'" T " int ° *• ^ ^nome according to 

e"ri7o T T" 0 ^ 0 "' ^ b - ba ^ent, biolistic transformation, and 
electroporatron (Gasser ef at., 1990, Sc/ence 244: 1293; Potrykus, 1990, BioTTechnCogy 8- 535- 
Shimamotoe/a/.. 1989. Nature 338: 274). . o or, ecnnoiogy s. 535, 

aen^ Pr ^ mV ' A9 Z baCteriUm ^ efec --^diated gene transfer is the method of choice for 
generating transgenic dicots (for a review see Hotwka* a „w * u-, 

« review, see Mooykas and Schilperoort, 1992, Plant Molecular 
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Bjoj 1ft ,5^6). However H c*, a,s, be use, for uansfotming monocote. aHhough other 
franslbrmabon method, ere generally preferred for mesa plan*. Presen»y. ,he nJodTcC 
fcr generabng monocofs is particte bowmen, <ml*oecopic gold o ItL" 

pedro.es ^coated wfth ft. trending DNA) o, embryonic calli or developing eLos (Ch^C 

Vasil ef el, 1992. Btortechnafegy 10: 667-674). An akernabve method for tZformeZ ^ 
monocote t based on proves, uansfcrmafioo descrjbe(| o^JT*^ * 
MofecularBiotogy21: 415-428. ' 
Following transforation, the transforments having incorporated therein the exoression 

The present Invention also relates to methods for producing a polvpeptide of the present 
invent compnsing (a, cu,,iva«„g a transgertc plan, or a plan, ce» conTpnlg a 
sequence encoding e polypeptide having pmtease acuvity of the presen, nLC IX 
condone conducive fbr preduCon of me polypepbde; and » recoverLg the poTep 1 le 

ZT- 3 ' ranSBen " ! ^ " ^ C " 9 3 P^^rTe^n 

cte,m 8. or en expression vector or polynucleotide construct of me invention. 

Animals 

element ^ T* **° 3 ' ranS3enlc ' non - hu ™*< »*"«' »d products or 

elemente thereof, examples of which are body «uids such as milk and blood omens flesTand 

ensiiy Kress (1999). and the three other handbooks in this series relatina to G«n» 
Trenscnptlon. RNA processing, and Posl-fransMona, Processing. OeoereUyZ^t pre^e 
a 1ransgen,c an™,, selected ceUs o, a selected amma, are fransfomieo 1 a nulTS 
sequence encoding a polypeptide having protease activity o, me present InvenJnTast 
express and produce the po*pephde. The polypeptide may be recovered from me anima, e 1 
from the mux of female animals, or the polypepbde may be expressed ,o me belf. of Za„i™, 
itself, e.g. to assist the animal's digestion Examoles of *>„i m , 

section headed Animal Feed. ' *" below ,n «» 

anima, 7 ^ * ****** "*""' a vlK " to -*»vertng the protease from me mi,K of the 

quesuon. e.g. by use of a transgene expression vector which comprises a suitable milk orotem 
promote, and the gene enco*g m e please. The trensgene express vecj^l^d 
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rto MM eggs, and prefer pennanenUy integrated into the chromosome Once the eca 
to crow and divide, me potent emb^o is implanted into a surrogate mother, and alats 
canytng the transgene are identifled. The resuKing enima, can then be muitipiied by conveZnt 

S P ° lyPeP,ide ^ 66 ^ *" *• - •* Meal. H iTTa 

5 («• Brpresston of recombinant plains in the mlik of transgenic animats, Gene Zstm 

~ £™ tbr the ad o, express*, , M. Pemanda, and , , HoeJ <H 

In the alternative, in order to produce a transgenic non-human animal mat cantes in the 
genome of d, somatic and/or gem, calls a nuCeic acid seguence Coding a hZk^ 

The invention relates to a transgenic, non-human animal, or products or elements thereof 
m compnstng . pCynuc^e, or an evasion vector or po^eoje consul dt^T' 

Methods of Production 

The present invention also relates to methods for producing a polypeptide of the ore^nt 
mvendon compdsing (a, coding a host cei, or a trensgenic pll J anZ^^Za 

30 " 3 S, ~* - -»* « — ^ 

In the production methods of the present invention, the cells are cultivated in a nutnant 
sudaHe for product of fhe polypepdde using memods known in the art Fo r aCT 
the cell may be cultivated by shake flask cultivation, small-scale or laree-scale fenZ^ 

25 fermemore penned in , suitable medium and under con^ons alcving the pelade to t* 
med,a are avertable from commercial suppliers or may be prepared according to published 
the polypepttde ,s not secreted. It can be recovered from cell lysates 

^ ' n0 " ,de "* °' ^ — — - a 
product or disappearance of a substrate Pnr OV3 m., 

• ^neflteacdvdyofthepo, ' """»—* 
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the no, ^ ""*« P ° l>Pe,, " ,le «» 69 •» methods known in the art. For example 

c*n,r«ugauon. .Kration. exbsCon. spray^. or preb,p«a tte „ 

Tl» polypepSdes of the present invention may be purified by a variety of procedures 

Mnophoo*. chrom*ofocus,ng, and size exdusbn,, efccnapho^c prunes (e.g., prepare 
tr,^* differenW SO " ,bl " ,y amm0n,Um SU "« a SOCAGE or 

p ' ote * , J - c - JanMn ana ure vc » *— 

10 

Compositions 

mJL'J!! flJ " her , aSPeCt ** ■»«*» to composes compnsing a 

poiypephdeofB.apresen.rnvention.The polypeptide compositions may be prepared in 

The !? ^ T.lr** COmPOSW ° n "* ' he ^ * a - a mlcmerLa* 

Known » me art. Examples are given below of preferred uses of the polypepbdes or polypeptide 
compositions of the invention. pu'ypepnae 

20 Animai Feed 

The present invention is also directed to methods for using the polypeptides of the 
rnvenbon ,„ animal feed, aa wen as ,o feed composMons and (JaddLW^ L 

25 LZ^ h T ™ ~™ nte ' a "° —» - cows, sheep and horees. 

~ r "T^""* an ' ma ' fe 3 n0n ■ ,U,ninan, animals inoiude mono- 

geafiic an,ma,s, e.g. pigs or swine (including, but not limited to. pigiets. growing pigs, and sows)- 
poulby such as tunreys. ducks and chicken (including but no, limted to broiler" *s 

£Zfr; but no1 wed *° ** -* -* 

crustaceans (.ncludmg but not limited to shrimps and prawns) 
30 The term feed or feed composition means any compound, preparation, mixture or 

compose suitable for. or intended for intake by an anima! 

. th T e aCC °^ in9 10 *• inV6ntl0n ^ Pr ° teaSe be fed to the ani ^ before after or 
simultaneously with the diet. The latter is preferred. ' 

In a particular embodiment, the protease, in the form in which it is added to the feed or 
when be,ng .ncluded in a feed addflve, is well-detined. Well-detined means that the p^ 
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preparation is at least 50% pure as determined by Size-exclusion chromatography (see Example 

ZT 5) - ' n 0thef ParfiCU ' ar 6mb0tf ^ ^ ~ P-ParJon Vat .east^ 
80. 85, 88, 90, 92. 94. or at least 95% pure as determined by this method. A well-defined protease 
preparation is advantageous. For instance, it is much easier to dose correctly to the feed a 
protease that ,s essentially free from interfering or contaminating other proteases. The term dose 
correctiy refers in particular to the objective of obtaining consistent and constant results, and the 
capability of optimising dosage based upon the desired effect 

in , H *V* ^ j " anima ' feed> Pr ° teaSe need not be that * ™* e.g.. 

includeother enzymes, in which case it could be termed a protease preparation. The protease 

preparation can be (a) added directiy to the feed (or used directly in a treatment process of 
vegetable protems), or (b , it can be used in the production of one or more intermediate 
commons such as feed additives or premixes that is subsequently added to the feed (or used 
■n . treatment process). The degree of purity described above refers to the purity of the original 
protease preparation, whether used according to (a) or (b) above 

Protease preparations with purities of this order of magnitude are in particular obtainable 

ZLT" bma ? t h m t° dS ° f Pr0dUCt, '° n ' they are not - -* obtained and also 

subject to a much h,gher batch-tc-batch variation when the protease is produced by traditional 
fermentation methods. Such protease preparation may of course be mixed with other enzymes 

,n a embodiment, the protease for use according to the invention is capable of 

so.ubihs.ng , vegetable proteins. A suttaWe assay for determining so.ubi.ised protein is disdosed in 
example 11. 

The tarn, vegetable proteins as used herein refers to any compound, composition 

ZZT • - 7"" ** " * ' eaSl ~ '~* dBrtTOd - a 
vegeteble. ,ndud,„g modified prefehs and proteirvderivafives. In particular embedments. ,ha 

Protem content of the vegetable proteins is a, leas, 10. 20. 30. 40. 50, or 60% <w/w>. Verbis 

CI maV , be , d t ed ^ Ve9eteWe ~ SUCh SS - — * 

examptematenals from plants of the famfes Fabaceae (cegam/nosae). Cmdfereceae. 

a^opo<*aceae. and Poaceae, such as soy bean meal, lupin mea, and repeseed mea.. In a 
particular ^embod, m ent the vegeteble protein souree Is materia, from one or more plants of the 
family Fabaceae. e.g. soybean, lupine, pea, or bean. 

in another particular embodiment, the vegetable protein source is materia, from one or 
more piante of the family Cbenopooreceae. eg. bee,, sugar beet, spinach or quince. Other 
examples of «seteble protein sources are repeseed. and cabbage. Soybean is a prefemed 
vegetable pretem source. Other examples of vegetable protein sources are cereals such as 
barley, wheat, rye, oat. maize (com), rice, and sorghum. 
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TheTO ^ la <^'n9«oth ei n»eml 0 n 0 fve9etabl e p ro te i „ s> ^ atteastonepr( ^ aseof 
the .nvention resulte in an increased solubilisation of vegetable proteins. The Mowing are 
exarnp.es of % solubillsed protein obtainable using the proteases of the invention in a monogas.no 
rn v*ro mode,: At teas. ,02%. ,03%, ,04%, ,05%. ,06%. or a, teas, ,07%. re,a,lve te 
The percentage of soiubifeed protein is determined using ate monogastdc /„ vHno model o, 

STstb ; T *" K°" ,b ' MOn * ^ b ™ ICa " y >™ te, "<*> «• ■**»■ 

Such •*•*-*» -ay be due ,o protease-madiatad retease of protein from other component o, 

the usually complex natural compositions such as teed. 

oaoaJV- emlMdlmen, • "» - *t use acooreing to the invention is 

^TT" am ° Un ' * proteins. The foltowing are ex*nptes of 

L If! " " 0b,ainable US!n9 0,6 Pr ° teaSeS * »• »»•"«» * * monogas.no 

^ mode,: At teas. 104%. 105%. ,06%. ,07%. ,08%. ,00%. or a, teas. ,10%. retebve to a 

^mpte^ " * 3 ° SU " " di9eS " bte Pr0tein fe •» * «*» "»*H of 

The following are examples of % digested or digestible protein obtainable using the 
protease of me invention in an aquaculture in «Uo model.- At leas, 103% 104% 105% Ly 
107%. 108%. 100% or a. leas. ,10%. retebveto a blanK. The percentege o, digged o ^bt 
protein is determined using the aquaculture in vitro model of Example 1 2 

*! Part ' KUlar «*•*«*. protease for use according to ma Invention „ 

capable of increasing the Degree of Hydrolysis (DH) o, vegetabte proteins. The following are 

ZP£ T; ITT 5 taease 0b,alnaM8 ln a mo — - — « ~ 

uang the monogaseic „ wto mode, of Example 11. The teliowing are examples of Degree of 
H£ro,ysis increase obteinable in an aquacuKure te woo mode,: At teas, ,02%. ,03% ,04% 

rmor;L:r,r'^ etoabiank " aDHisda, ~^- a ~'« 

'"aP^'^bodirmm^ 
quesbon ,s aKecbng <c acbng on. or exerting ite sCubillsing France on) ,he vegeLie pro J s 
or protein sources. To achteve thls. the vegetobie protein or protein souroe is typicaHy suspended 
»•«*»* e* an aqueous solvent such as water, and the pH and temperature values are 
ad^ed paying due regard to Ihe charactenaBcs of me enzyme in quesbon. For exampte. me 
*eate,en. may take ptece a. e P H-value at which the ac»v*y of me actual protease is a. leas, a, 
leas, 40%. 50%. 60%. 70%. 60% or a. leas, 00%. Likewise, tor exampte. the beamren, 
place at a temperatore at which ,he achv«y of me actual protease is at teas, 40%. S0% 60% 
70%. 80% or a, teas, 90%. The above percentege artvlty indicabons are relabve to me maxim 
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actenhes. The enzymatic reaction is continued untit the deaired rasui, is acttieved. foiling which 
t may or may no, he stopped by inaafcatog me enzyme. e. g . by e hea M reato,ent step 

in another particular embodiment of a beatoten, process of the invention, the protease 
aeon ,s sustemed. meaning e.g. that me prefer is added to the vegetable proteins or £Z 
aouroaa. but „ sousing lnfluence „ „ „ ^ m ^ ° 

suable sousing condHtons ere established, or once any enzyme infers are Z 
whatever other means could ha»a been appted to postpone the action of the enzyme 

In one embodiment the treated la a pnMreetment of animai feed or vegetable protaina 
for use in animal feed. =a<=i«»e proieins 

and/or * Tr!?^ *" mM " m! " * " means '""^"8 the availably 

T, Pr0teinS ' *"* ISadh9 '° — — •» «ne dW 

~mponen* htgher protein yields, increased protein degraded end/or improved protein 

s^ aa grow* rate and/or wetgh. gain and/or tead conversion ralio (i.e. me weight of ingested 
feedralativetowaightgainjoftheanimalls/areimproved 

Jl ° r 10% ' a ambodimant the weight gain is 

ZZ. f T ^ 3% ' ^ 5% ' 6% ' ?% ' 8% ' 9% ' 10% " * — Tnea. «Z are 
relative to control experiments with no protease addition 

EEC (1988). Direct™* da la Commission du 9 aval 1988 fixer* la rmMhode da calcul da la vaieur 

zszzzzr- — ■ * * — «— - *~ - 

The protease can to added to the feed m any form, be it as a relatively pure protease or in 
c*er ccraponents intended tor add»on to arfma, teed. i.e. te'the L Z 
feed additives, such as the so-called prefixes for animal feed 

such as'aiLT'/T ** ^ kWen "° n ^ '° *r «" in anima, tead, 

such as animal feed, and animal feed additives. e.g. premixes 

a. teast'cTfeZ,,!!' r TT"* m ' nVenSOn ' lhS «« addles of me tnventton contein 

ZZ^lTm ; M "" ,te ™«*^«. and/or a, teas, one trace 

mineral. The feed additive may also contain at least one macro mineral 

stebiiJ^ ° P T"' feed - addi8re « colouring agems, aroma compounds 

gateZateTc^lT 951 , ^ 313 8 " 3 , '" 6,: *"»" <« 

gaiactenase (EC 3.2.1.89): aipha^aiactosidaae (EC 3.2. ,.22); protease (EC 34- 
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Phosphor ai (EC 3:1.1.32); phosphoHpase A2 (EC 3.1.1.4): lysophosphollpase (EC 311 sy 
^7 C <3 - , 4 - 3,: ° » 3.1.4.4): **/or bet^lucan/se (EC £Ut 

5 proie,^! 0 ::, embo< " men ' *~ o,h9r ^ are * — - 

1 Than^nT'? °'.T ier ° bia ' P " P8CbS (AMP ' S) are CAP18 ' '""»* ^ ™rptiei„, Protegnn- 
££T , ^ LaC " ,fe,riCln - °^ 8Plrin 5UCh as No ^" <W« Uhrer 

■ r^r* indudina 8,6 Mmp ~ nds - *— * 

10 ZZZZ? T/D ~ 1 * as - M variante « - - — « — . 

nta, nfT P ' eS anWUn0a ' P ° ll ' PeptW6S <*«*•> « •» A****. glgantBus . and ^ pe ^ te 

™B»r pepbdes. as well as variant and fragments .hereof which re* arrftagal J2 2 
disclosed in WO 94/01459 and WO 02/090384. 

Usally fa,, and water-soluble viramins, as we» as trace minerals forra pan of a soiled 

r: ,or addK,on ,o 0,6 ,ea * wheraas macro mina * - —* ~ ^ 

indudJ,"/™ ParaCU ' a ' en,bodlmen '' * e ■** »- *«ve of In. Invemion is intended for being 

'"^<OfPr=^bedashaving to beinduded)inanima l die b or,eeda tl a V e lS of0 01 (o100% 
20 more particularly 0.05 to 5 0%- or n 9 tn 1 no/ ,0/ - * 

pedicular for preraixes ' % mean "' 9 Th* is so in 

The following are nonexclusive lists of examples of these components- 
Viramin *"* K ° 3 ' *"* E ' - *• e.g. 

' B2 *S nlatnt" 80 '? I?"*" ™ VHam '' n ^ *** *"* »«• —*> 

B2. vitamin B8. niacn, folic ao,d and panthothenate, e.g. Ca-D-panfhothenate 

^ Examples of .race minerals are manganese, .inc. iron, copper. Iodine, selenium, and 
Examp.es of macro minerals are calcium, phosphorus and sodium 

«*. zT", nUb ! toni " reqUiramen,s <* *— components (exempflfied „«h poultry and 
piglets/pigs are listed in TableA of WO 01/5S97* mmm.- . pourayano 

. . , en or wu Q1/5827S. Nutritional requrement means that these 

components should be provided in the diet in the concentrations indicated 

hduJ! a " ema " Va ' ^ a "" nal ^ a *"" Ve °' 9,8 inve *" «-P"ses a. leas, one of a. 

: CTT^r TaHa A * W ° ° 1 ' 58275 - " tea * « — • — * « - 

more of. one, or two, or three, or four and so forth up to al, .hldeen, or up to „ Mean jndlvidua , 
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componente. More specially, mis ,„ dMdual „ inc|uded 

ta "nvenbon ,n such an amount as to provide an in^e^centnaion within tha renge indicated 

tncolumnfour.orcolumnfivB,orcolumnsb<ofTableA. 'novated 

or di«sT T I"" 8 "" 0 " 3180 Wlate8 10 animal feSd ""X***™- Anlma. faad compositions 
or d,eto hava a relate hlgh canton, of protein . Poultry and plg ^ ^ ^ ~ 

indicated „ Tabto B o, WO 01/58275, catumns 2-3. Fish diato can ba charaotansad as 

9*9. WO 01/58275 oonasponds to US 09/779334 which is haraby incorporated by reterance 

800 o* a TdH feed C ° mP0SlUOn aCC0K " n9 * ^ ' nVen "° n haS a "* •»«* ««« - *>- 
800 9*9. and furthermore oomprtsea at leas, one protease aa claimed herein 

Furthermore, or in the aKemative (to the crude protein content indicated above) tha animal 
feed compose of me invention haa a content of metabolisable energy o, 10-30 
content of cafcium of 0,-200 and/or a content of available phosphorua oTfl 20o"L 

0.1-1 50 g/kg; and/or a content of lysine of 0.5-50 g/kg. 

in particular embodiments, the content of metabolisable energy, crude protein calcium 
Phosphorus methionine, methionine plus cysteine, and/or lysine is wZ any on o 1^7 3 
4 or 5 in Table B of WO 01/58275 <R. 2-5). 

WW-fTlTTJ. CateU ' ated M n ' ,,09e^, (N> mU " iP ' ied * 3 **» 6 25 ' "» «** P>«ein 
ZnZ DC, ^ ^ * °' °~ ^ 

»r,im„i „ . ••• .. .. ""w earaon 1988, subcommittee on swine nutrition, committee on 
animal utnton, board of agriculture, nataa mmmth ^ Nationa| ^ ^ 

s ~ • t r ** and 0,6 Eurapean Tabb * v *- * ^ 

ferafisch bedmf Ponson * loogen bv. Wageningen. ISBN 90-71463-12-5 

diato iaT, Tl T °' CalC ' Um ' aVa " able Ph ° Sph0,US and ■*» * anima, 
die* s caiculeted on me baaia of feed tebles auch as Veevoedertabel 1997, gegevens over 

rr^b^T'r ~"*- - - voedenJLten. CeZ 

Veevoedarbureau. Runderweg 6. 8219 pk Lalysted. ISBN 90-72839-13-7 

one veol^rT a " imal «** ^PosHion of the invent contains a, leas, 

one vegetoble protein or protein source as defined above. 
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in Ml further particular embodiments, the animal feed composition of the Invention 
cor^ns 0*0* ^ and/or 0-60% alburn; andfor 0-70% wheat; air 0-70% Z^ZZ 
0-30% oats: and/or CM0% soybean meat and/or 0-10% fish meal; andfor 0-20% whey 

Ammal d,ets can e. 9 . be manufactured as mash feed (non pelleted) or pelleted feed 

mnerals am added aecordtng to the specifications for the spades in question. Enzymes can be 
a ded „s* o, ^u,d enzyme formulations. For exempt, a solid enzyme JZZZZ 

palling step. The enzyme may also be Incotporated in a feed additive or prambt 

The final enzyme concentration In the diet is within the range of 0 01-200 mo am™. 

be,ng in mg protease enzyme protein per kg feed (ppm). 

For determining mg enzyme protein per kg feed, the protease is purified from the feed 
~rnpos*on, and the specific acfivity of the purified protease is defended using a r^am asTay 
(see under protease acfivity. substrates, and assays). The protease acttityT^ 
20 compose as such is also determined using the same assay, and on the bH oUuL Z, 
determines, the dosage in mg enzyme protein per kg feed is caiculated 

const 7T T"^ ^ fU,ther * «" f ° ,,OWin 9 exam <> les *** ^ould not be 

construed as limiting the scope of the invention. 

Deteroent Com positions 
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The protaaaa of the invention may be added to and thus become a component of a 
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more other en.ymes such as another protease, such as afcaline proteases to bJ2T! ZT 
etc.). and the enzyme^) should be present in effective amounts «nored,ents, 

from P. afca^nes or P. pseooV^,^ 218^2, T aRseI "* >m0 " as 

(GB 1 372 n™ o „ "^"aenes (EP 218272), p. cepaoa (EP 331376), p. sfufeetf 

It^T ^ """*'"»'- * ** SD 70S (WO 95)06720 and WO 

Suitable cellulases include tho*:^ k^ M - . * . 

pmteln endeared mutants are Z£~L 7 . ^ ° h6mfca " y ^ " 

Sac**. Pseodfcmonas. HmJfTLZT , e **~ ** ** "~» 

produced ta, „ umiOT/a ~ ^'"1 6 * ** ^ tMmm 

disclosed m US 4435 307 usTL and Pusadum o^x^n 

^ 11,8 alkal,ne °T cellulases having colour care benefits. 
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WO 94*4 8 ^ o~5 T S ~ ^ ^ M *— *> 

98/12307 and « c I 5 686 ' 593 ' US 5 ' 763 ' 254 - W ° WO 

Perexidases L SZTT? &amP ' eS 01 ^ P8ro>ddaSeS 

Guerefoyrne™ (Novozymea). Comment* avallable include 

addi^tnST o r' me<S> may te ,nC " ,dad 8 deto9ent «"*•-» •» — • aaP-ata 

*- ^z^zzzrz^: Mmwn i a *^ — - 

>s additive, can be formulated e „ . separate ■**«» or a oombinad 

Canons JZSZ t Z£T*J ° *** * 

or manias ^ ™ KlUS " n9 *"*<* ■«**' atebtead 

4 9ra " UlateS ^ b9 e ' 9 - " *— - US 4,06 991 and 

4,661,452 and may opt onally be coated hvm^fh^ u . ,,UD ' yyi ancf 

to 80 eeiylene oxide ante- hay alltTjT ! a,0mS ^ *"•■•« 

EP 2382^. ™» 66 Prapared a ^ di "9 to the mearod disclosed in 

-* art~i~ b ; : r — ^ - a - ■ 

^, ypre sema.e,eve,of fr ornO., %to6 o %Dyw Xnr n ° n ' C ' ^ ~ 
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When Included therein the detergent will usually contain from about 1% to about 40% of an 
a Z S ~ SUCh " " near ^-nesulfonate. a, P ha.,efinsu,fonate. £t21 tZ 

5 . n« When ^ th8rein d6te ^ ent Wi " USUa "y from about 0.2% to about 40% of 

febHc J^rT* ^ *° °° n,a,n ""^ *•■■«« ingredients such as e a 

fabnc conditioners includinq clavs foam hno^r.. ^ sucn as e -9- 

suspend , gente , sr^e^TtL ■* 
enzyree pre,* pe, lter <* ^ ^ , 0 retr^ ' * " ^ * 
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amh0 dW .T^ deSCribe<, ^ *"* fe M to «» ■««• *> =coPe by the specific 

di!dosed ' slnce 11,656 -*«*— are «— - d 

«"* «— "edifications of the invasion m ^ to ^ £ ^ 

5 here,n w m become apparent to those sk»,ed ,n the ar, from ^ foreaolng de 
n^cafions are siso Intended to M wfibin ^ scope of ^ 
conftct. the present disclosure including definitions will control " the case of 

in fire, SET"""-" ^ ^ °' " - ^rence 

) 

EXAMPLES 
Materials and methods 

Strains: 

a^rt , P T r denChS6n ' B 61 6I - ,99 °- Cton '"» " aldB . — -codes 
deCarb0!,yl3S6 ' " 6XMnZym9 *°™ *— ««-* J- Bacterio,. 

Bacillus subtilis MB1053 
Bacillus subfilis PL3598-37 
Bacillus subtilis MB1510 

BscmvsuWs PL2306. This strain is fire B.sums DN1885 with disrupted apr and „„r 

ZXTTt- B - Wedsted - u - H6d696ar * L - ^ »■ *■ Z 

Oonmg of atdB. wh,ch encodes a-pha-acetoiacfcte decarboxylase, an exoenzyme front 
•"• Bactedo... 172, which is also dlsrup.2 in Z 

transcnpfiona, uni, o, the Known Bac^ aub«a cefiuiase gene, resulting in cell 
negate ce»s. The disruption was penned essenfially as described to (E* Tl 

zr^:r and R,cha,d Loack (,993) ea * a su °* - 

Positive Bactena, American Society for microbiology, p.618). 
Procedure for isniaring oen omi^ DNA 

AdTsoo ul^' C T r feSUSPend 1 00 M ' TEL ^ 31 37C * 3 ° «*• 
Add 500 pi thiocynate buffer and leave at room temperature for 10 min 

Add 250 pi NH4Ac and (eave at ice for 10 min. 

Add 500 pi CIA and mix. 

Transfer to a microcentrifuge and spin for 1 0 min. at full speed. 
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roro!gLr ematant t0 3 ^ EPPend ° rf tUl>e ^ ^ 0 V °' Ume iS °^ anoi - Mi * 
Spin and wash the DNA pellet with 70 % EtOH. 
Resuspend the genomic DNA in 1 00 pi TER. 

TE: 10mMTris-HCI, pH7.4 

1 mM EDTA, pH 8.0 
TEL: 50 ™g/ml Lysozym in TE-buffer 

Thiocyanate: 5M guanidium thiocyanate 

100 mM EDTA 

0.6 % w/v N-laurylsarcosine. sodium salt. 
60 g thiocyanate, 20 ml 0.5 M EDTA, pH 8.0, 20 ml H20 
dissolves at 65C. Cool down to RT and add 0:6 g N- 
laurylsarcosine. Add H20 to 1 00 ml and filter it through a 0 2 u 
sterile filter. 

NH4Ac: 7.5 M CH3COONH4 



TER: 
CIA: 



1 ug/ml Rnase A in TE-buffer 
Chloroform/isoamyl alcohol 24:1 



Purification of pp.p h ands anri dm a sequencing 

PCR fragment can be purified using GFX™ PGR DNA and Gel Band™ Purification Kit 
Pharmaaa Biotech) according to the manufacturer's instruction, The nucleotide ^ es o 
10 the amplrfed PCR fragments are determined on an AB. PRISM™ 3700 DNA AnaZTJn 
E mer, USA) using 50-100 ng as template, the Ta q deoxy-termina, cycle selling k Pel 
amer, USA), fluorescent labeled terminators and 5 pmo, of the sequencing pier of choi J 

Wedia 

^ LB-PG agar is LB agar supplemented with 0.5% Gfccose ana 0.05 M potassium phosphate, pH 
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S2A protease activity is measured using the PNA assay with succinyl-alanine-alanine- 
prohne-phenylalnine-paranitroanifide as a substrate unless otherwise mention. The principle of the 
PNA assay is described in Rothgeb. T.M., Goodlander, B.D.. Garrison. P.H., and Smith. LA., 
Journal of the American Oil Chemists' Society, Vol. 65 (5) pp. 806-810 (1988). 

Gene exp ression in B acillus subWg hn<a 

All the expressed genes in the following examples are integrated by homologous 
recombination on the Bacillus subtilis host cell genome. The genes are expressed under the 
control of a triple promoter system (as described in WO 99/43835), consisting of the promoters 
from Bacillus licheniformis alpha-amylase gene (amyL), Bacillus amyloliquefaciens alpha-amylase 
gene (amyQ), and the Bacillus thuringiensis crylllA promoter including stabilizing sequence The 
gene coding for Chloramphenicol acetyl-transferase was used as maker. (Described in eg 
D,denchsen,B. : Poulsen.G.B.; Joergensen.S.T, A useful cloning vector for Bacillus subtilis 
Plasmid 30:312 (1993)). 



Example 1 

A synthetic 10R gene (10RS) encoding a S2A protease denoted 10R from Nocardiopsis 
sp. NRRL 18262 (WO 01/58276) was constructed which has the nucleotide sequence shown in 
SEQ ID NO: 1. This synthetic gene was fused by PCR in frame to the DNA coding for the signal 
pept.de from SAVINASE™ (Novorymes) resulting In the coding sequence Sav-10RS which is 
shown ,n SEQ ID NO: 2. Several tail-variants of this construct were made. Compared to the Sav- 
10RS protease encoded by SEQ ID NO:2 the tail variant construct Sav-10RS HV0 was 
constructed to have 8 amino acids extra in the CMerminus: QSHVQSAP (SEQ ID NO* 3) which 
were encoded by the following DNA sequence extension inserted in front of the TAA stopcodon of 
25 SEQ ID NO: 2: 

(SEQ ID NO: 4): caatcgcatgttcaatccgctcca 

Tail variant Sav-1 0RS HV1 was constructed to have 4 amino acids extra in the C- 
terminus: QSAP (SEQ ID NO: 5). with the following DNA sequence extension inserted in front of 
30 the TAA stopcodon: 

(SEQ ID NO: 6): caatcggctect 

Tail variant Sav-1 0RS HV3 was constructed to have 2 amino acids extra in the C- 
term,nus: QP (SEQ ID NO: 7) with the following DNA sequence extension inserted in front of the 
35 TAA stopcodon: 



(SEQ ID NO: 8): caacca 
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Tail variant Sav-10RS HV2 was constructed to have one amino acid extra in the C- 
terminus: P (SEQ ID NO: 9) with the following DNA sequence extension inserted in front of the 
TAA stopcodon: 

(SEQ ID NO: 10): cca 

The 10RS gene and the four tail-variant encoding genes were integrated by homologous 
recombination into the Bacillus subtilis MB1053 host cell genome. Chloramphenicol resistant 
transformants were checked for protease activity on 1% skim milk LB^PG agar plates 
(supplemented with 6 ug/ml chloramphenicol). Some protease positive colonies were further 
analyzed by DNA sequencing of the insert to ensure the correct gene DNA sequence, and five 
strains, each comprising one of the above constructs, were selected and denoted, respectively: 
B.subtilis Sav-10RS. B.subtilis Sav-10RS HVO, B.subtilis Sav-10RS HV1, B.subWs Sav-10RS 
HV2 and B. subtilis Sav-10RS HV3. 



Example 2 

Fermentations for the production of the tail-variant enzymes of the invention were 
performed on a rotary shaking table in 500 ml baffled Erlenmeyer flasks each containing 100 ml 
TY supplemented with 6 mg/l chloramphenicol. 

Six Erlenmeyer flasks for each of the five B. subtilis strains from example 1 were 
fermented in parallel. Two of the six Erlenmeyer flasks were incubated at 37°C (250 rpm), two at 
30»C (250 rpm), and the last two at 26°C (250 rpm). A sample was taken from each shake flask at 
day 1, 2 and 3 and analyzed for proteolytic activity. The results are shown in tables 1-3. As it can 
be seen from tables 1 -3, the effect of the tails is a surprisingly high improvement on the 
expression level of the protease, as measured by activity in the culture broth. The effect is most 
pronounced at 26°C and 30°C, but is also evident at 37'C as an effect observed especially at the 
early stage of the fermentation. 



Table 1: Relative proteolytic activities at 37°c. 





Day 1 


Day 2 


Day 3 


Sav-10RS 


1.0 


1.0 


1.0 


Sav-10RS HVO 


3.3 


0.7 


0.6 


Sav-10RS HV1 


4.7 


1.3 


1.2 


Sav-10RS HV2 


- 2,2 


0.6 


0.4 


Sav-IORS HV3 


5.3 


1-4 


1.7 
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Table 2: Relative proteolytic activities at 30*C. 





Day 1 


Day 2 


Haw O 
way «3 


Sav-IORS 


1.0 


1.0 


1,0 


Sav-10RS HVO 


1.7 


2.2 


2,9 . 


Sav-10RS HV1 


4.6 


3.1 


4.9 


Sav-10RS HV2 


2.4 


1.9 


2.3 


Sav-10RS HV3 


4.8 


3.0 


4 r 4 


Table 3: Relative proteolytic activities at 26°c. 






Day 1 


Day 2 


Day 3 


Sav-10RS 


1.0 


1.0 


1.0 


Sav-IORS HVO 


1.8 


2,5 


3.1 


Sav-10RS HV1 


2.5 


3.6 


4.3 


Sav-IORS HV2 




2.6 


2.8 


Sav-10RS HV3 


2.6 


3,5 


4,6 



10 



15 



20 



25 



Example 3 

The following construct was used for the chromosomal integration of the tail-variant 
encoding genes. The coding sequence of the well-known subtilisin BPN' protease was 
operationally linked to a triple promoter, a marker gene was fused to this (a spectinomycin 
resistance gene surrounded by resolvase res-sites), and pectate lyase encoding genes from 
Bacillus subblis were fused to the construct as flanking segments comprising the 5' polynucleotide 
region upstream [ymi^ytmC-yfmB-yfrnA-Pel-start], and the 3' polynucleotide region downstream 
[Pel-end-yflS-citS(start)] of the tail-variant encoding polynucleotide, respectively. The integrational 
cassette was made by the joining of several different PCR fragments. After the final PCR reaction 
the PCR product was used for transformation of naturally competent B. subtilis cells. One clone 
denoted PL3598-37 was selected and confirmed by sequencing to contain the correct construct 

The PL3598-37 clone thus contains the following: 

1. The flanking regions 100% homologous to region of the B.subtilis genome (appears as the 
upstream fragment yfmD-ytmC-yfmB-yfmA-Pelstart and the downstream fragment Pel-end- 
yflS-citS(start)). 

2. The Spectinomycin resistance gene flanked by Resolvase sites (res). 

3. The triple promoter region plus CrylllA mRNA stabilising leader sequence. 

4. The BPN' Open Reading Frame. 

Construction of triple pro moter BPNI' rassptte 
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A PCR fragment comprising the integrational cassette for a BPN' library was 
constructed, thus operably linking a triple promoter (as described in WO 99/43835; Novozymes) to 
a BPN' expression cassette from a Bacillus strain. The triple promoter is a fusion of an optimized 
Bacllus amyL-derived promoter (as shown in WO 93/10249; Novozymes) with two promoters 
scBAN and crylllA. where the first is a consensus version of the Bacillus amyloliquefaciens 
amylase BAN promoter, and the latter includes a mRNA-stabilising sequence (as described in WO 
99/43835; Novozymes). Suitable primers can be derived from the publicly available sequences 
(Vasantha, N. et al. Genes for alkaline protease and neutral protease from Bacillus 
amyloliquefaciens contain a large open reading frame between the regions coding for signal 
sequence and mature protein. J. Bacteriol. 159:811 (1984) EMBL: accession No. K02496). A Kpnl 
and a Sail restriction site was introduced to flank the PCR fragment at each end, using the 
primers: 




#252639 (SEQ ID NO: 11): _ v _ 

#251992 (SEQ ID NO: 12): catgtgcatgtggTcgac^^ 

The Kpnl and Sail restriction sites in the PCR fragment were subsequently used to clone 
the fragment into a Kpnl-Sall digested Peel-Spec PCR fragment The Peel-Spec fragment 
compnses a Spectinomycin resistance gene inserted in the middle of the B.subtilis Pectate lyase 
gene plus approx. 2.3. kb of upstream genomic DNA and approx. 1.7 kb downstream genomic 
DNA The Peel-Spec fragment was produced by PCR amplification of genomic DNA from the 
B.subtilis strain MB1053, using the primers: 

#179541 (SEQ ID NO: 13): gcgttgagacgcgcggccgcgagcgccgtttggctgaatgatac 
#179542 (SEQ ID NO: 14): gcgttgagacagctcgagcagggaaaaatggaaccgctttttc 

Construction of MB1053 

The MB1053 B.subtilis strain was constructed by deletion of the pectatelyase (Pel) gene 
through integration of a PCR product into a wild-type B.subtilis typestrain genome. This was 
achteved by a PCR amplification of genomic DNA directly downstream and upstream of the 
Pectate lyase gene of the B.subtilis. 

The ends of the genomic DNA directly preceding and proceeding the Pel gene were 
elongated through primer insertion of sequences being 100% homologous to DNA sequences 
defined by the ends of a third PCR fragment encoding a marker gene surrounded by Resolvase 
(Res) s.tes. In th.s particular case the marker gene (Spec) conferred resistance to spectinomycin. 
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and it was situated between two Res sites, altogether present on the plasmid pSJ3358 (described 
In US patent No. 5.882,888). Three different PCR fragments were initially produced. 

Fragment 1: this fragment covers from the yfmD gene to the middle of the Pel gene and 
introduces an overhang to the Res-Spec-Res cassette at the Pel gene. The size of fragment 1 is 
2.8 kb. The fragment was produced by a PCR amplification chromosomal DNA from the B.subtilis 
strain PL2306, using the primers: 

#1 79541 (SEQ ID NO: 13), and 

#179539 with overlap to #179154 Spec primer (SEQ ID NO: 15): 

tec 



Fragment 2: this fragment covers from the middle of the Pel gene to after the end of the CitS gene 
and introducing an overhang to the Res-Spec-Res cassette at the middle of the Pel gene. The 
size of fragment 2 is 2.3 kb. The fragment was produced by a PCR amplification of chromosomal 
DNA from the B.subtilis strain PL2306, using the primers: 

#179542 (SEQ ID NO: 14), and 

#179540 with overlap to #179153 Spec primer (SEQ ID NO: 16): 

ggatccagatctggtacccgggtctagagtcgacgcggcggttcgcgtccggacagcaca 

Fragment 3: this fragment contains the Spectinomycin gene surrounded by Res sites and DNA 
sequences in the ends overlapping with PCR fragment 1 and 2. The size of fragment 3 is 1.6 kb. 
Fragment 3 was produced by PCR amplification of plasmid pSJ3358, using the primers: 

#179154 (SEQ ID NO: 17): gttgtaaaacgacggccagtgaattctgatcaaatgg 
#179153 (SEQ ID NO: 18): ccgcgtcgacactagacacgggtacctgatctagatc 

Standard conditions for the PCR reaction 

For the PCR amplifications of fragment 1-3 the HiFi Expand™ PCR system (Roche) was 
used together with the following cycling scheme: 

5 pi Buffer 2 

14 pi dNTP's (1.25 mM each) 
2.5 ud 20 pM primer 1 
2.5 pi 20pM primer 2 
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x til water 

To this mix 3 pi of DNA (apx. 100 ng) and 0.75 pi Enzyme mix (use hot start) is added. 
Total volume is 50 pi. 
The cycling profile is: 
5 1 cycle of 1 20 sec at 94*C 



10 



20 



25 



30 



Break. 



10 cycles of 15 sec at 94'C. 60 sec at 60*C, 240 sec at 72*C. 

20 cycles of 15 sec at 94*C, 60 sec at 60'C, (180 sec at 72'C add 20 sec pr cycle) 

1 cycle 600 sec at 68"C. 



The three PCR fragments were made and joined in later JOINING-PCR reactions. The three PCR 
fragments were single sharp bands and no gel purification was necessary. Only Qiagen™ PCR 
purification was performed prior to the following JOINING-PCR. 
JOINING of fragment 1 + 3 (same procedure for fragment 2 + 3): 
15 5 pi Buffer 2 

8 pi dNTP's (1.25 mM each) 

5.0 pi Fragment 3 

5.0 pi Fragment 1 

9.25 pi water 



1 cycle of 1 20 sec at 94"C. 
Break. Add Enzyme 

10 cycles of 15 sec at 94*C. 60 sec at 60'C, 240 sec at 72'C. 
Break. Add Primers 

15 cycles of 15 sec at 94'C, 60 sec at 60*C,(180 sec at 72'C add 20 sec pr cycle) 
1 cycle 600 sec at 68*0 

After the first cycle at 94«C for 120 sec there is a break, where 0.75 pi Enzyme mix is added. 
Total volume is now 45.0 pi. 

After the initial 10 cycles, there is another break in the cycling and for fragment 1+3: 2.5 pi (20pM 
#1 79541) and 2.5 pi (20 pM #179153) are added and for fragment 2+3: 2.5 pi (20pM #179542) 
and 2.5 pi (20 pM #179154) are added and the cycling is continued for 15 cycles more. 
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The PCR products were then gel purified: The size of fragment 1+3 should be 3.4 kb 
and the size of fragment 2+3 should be 3.4 kb. These two fragments were joined in a last PCR 
reaction (Expand™ long system. Roche): 

5 ul Buffer 1 

14 pi dNTP's (1 .25 mM each) 
5.0 pi Fragment 1+3 
5.0 pi Fragment 2+3 
17.75 pi water 



After the first cycle at 94'C for 120 sec there is a break, where 0.75 pi Enzyme mix is added. 
Total volume is now 45.0 pi. 

After the initial 10 cycles, there is another break in the cycling and 2.5 pi (20pM #179541) and 2.5 
pl (20 pM #1 79542) is added and the cycling is continued for 1 5 cycles more. 
15 1 cycle of 120 sec at 94'C. 

Break. Add Enzyme 

10 cycles of 15 sec at 94*C. 60 sec at 60*C, 240 sec at 68*C. 
Break. Add Primers 

15 cycles of 15 sec at 94'C. 60 sec at 60'C, 180 sec at 68*C add 20 sec pr cycle 
20 1 cycle 600 sec at 68*C. 

The size of the joined PCR fragment is 6.8 kb. This PCR fragment was purified using a 
Qiagen™ PC R purification kit, and 5 pl of the 50 pi eluted DNA was used to transform a standard 
B.subtifis strain. After transformation cells were spread onto LBPG-120pg/ml of spectinomycin. 

25 Next day more than 1 000 colonies were seen. 8 of these were checked using PCR primers from 
last JOINING PCR amplification yielding PCR fragment of 6.8 kb rather than the 5.2 kb expected if 
deletion had not occurred. Furthermore, the pectatelyase activity of the clones was checked with 
the Mancini Immunoassay, which showed no reactivity towards the pectatelyase activity. This 
taken together with the Spec resistance tells us that deletion had occurred. One such clone was 

30 selected and denoted MB1 053. 

Insertion of BPN' expression cass ette adjacent t 0 the res-snares in MB10S3 

The ligation mix of the digested PCR amplified triple promoter BPN' expression cassette 
and the Kpnl-Sal digested Peel-Spec PCR fragment was used as template in a PCR amplification 
35 using the PCR primers #179541 and #179542. This resulted in a PCR fragment of approx. 9 kb 
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wto, was «ed ,o ttansterm a**m PLieo, (Bderichsen. B e, a,. ,990. Coning of aldB which 
ZZZT ^ d ""*"*-* - ««»»» Mta Previa. J Bacterid T 

a 1^,? TT ™* taB *™ d *" S ^ " LB - 12 ° Spacing 

pnmers »i /-9541 (SEQ ID NO: 1 3) and #1 79542 (SEQ ID NO- 14) 

into .ha Z^T " % 9 * fra9me '" indiCateS *■' PCR *"•"«"« has >•« '"^ad 
Z^JTj^Zr™' "T" 1 * — — «~ to confirm ^ "„ ». 
^ expmss,on cassette, one such clone was selected and denoted PL3598-37. 

Example 4 

An IntaonT ^ t^™™*™" ""^""al cassette for e .ibraty may be consttucted .„ vivo 

is tzsr rapawe on,y * rep,ica,ion h *-* - » *— * - *— • 

0 The DNA sequence encoding the Pre^rc-domains of the aubtiltsin protease 
commonly known as Savinase, preceded by and operabiy iinked to 

case J'llcZZr 3 ^ """" ^ * 

20 i "> ama *ar 8 ene(achloramphenlcolreslstancegene) and 

"I*"*** "P*ream of the polynucteottde (yfrnD-ytntC-yfrnB-yfrnA-Pa^rtj and 

reUC °' " * ~ J"£ 

5 and PCR?* T T ma " B * ^ C ' 0nln9 aepa digestion of pUC19 ptaamid 

and PCR fragments with appropriate resthctton endonuciaaaa sites of several differed PCR 
fragmente ,„ me generally used piasmid pUC,9. Attar each tigation of a PCR frZTiT^ 
Piasmid. the iigaaon mWure was turned int o electtocompeten, DHsZ 

tet^ 1 * SUPP " er - 006 ^ p,aSmW «» by sequencing 

te contain the correct consttuct as outtined above, and | was denoted pMBtSoe. 

The PMB1508 piasmid thus contains the following: 

i) The C/V///A mRNA stabilising leader sequence includina a riho.™.. „• „• 
: (RBS), operationally linked to M "^'"9 a nbosome b.nding sequence 
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H) DMA encoding the Pre-Pro-domains of the subtilisin commonly known as Savinase 
mcludmg Kpnl and Notl sites for cloning; as cavinase, 

iii) The chloramphenicol resistance operon; 

homo. h \ 7 !T 3 ' dOWnStream flankln9 re 9 ion fPel-end-yflS-citS(start)J which is 99-100% 
homologous to the region of the B.subtilis. 

372*, f 7 r r mentS " Sted ^ C ' 0ned h the PUC19 Vector <W— ^m E.co// ATCC 

rnt^oln ri 9 ^ P ' aSmidS ' an M ^m P 7^erived system for insertion 

mutagenesis and sequencing with synthetic universal primers. Gene 19: 259-2S8, 1982 ) in the 

TZ T'° ^ PMB15 ° 8 - ' n ° rderfor *• reSU,ti " 9 P,asmid to jnte ^ Gently to 

a pecfied s ,te of th B.suMis genome, a new strain was established. The new strain is a 

d^-ofBaaHussubtaislBSBGSCacoesaion number 1A1 168 trpC2 . The strain was made 
competent and transformed as described above. Using elements from the PL3598-37 clone 

toTllfn* *• inte9rati ° n ^ den ° ted MB151 ° Was es ^hed and characterised 
to contain the following elements from PL3598-37: 

i) The triple promoter and the mRNA stabilising element. 

vtmC-vfrn* rTp 9 , 7^ C ° mPriSin9 ^ f0 " OWin9 h0 ™ to °™ Polynucleotide region ly fmD- 
ytmC^yfrnB-yfmA-Pel-start! upstream of the triple-promoter, and the polynucleotide regionW 

end-ynS-citS(start)]downstreamofthemRNAstabi«izinge.ement. 

denv r T T ^ ^ MB151 ° CeHS ' " IS P0ssib,e for the PMB1508 (or 

der.at.ves thereof) to directly integrate into the genome of MB1510 where the two flanking 

eg.ons in fus,on with the triple-promoter and mRNA stabilising element is .ocated. resulting in a 
construction where the incoming PrePro encoding DNA of P MB1508DNA has been integral in 
the correct reading frame with the tripei-promote, the mRNA stabilising eiement and the RBS 
Thus resulting ,„ nigh expression of the integrated gene from the promoter elements already 
present on the genome of MB1510. 

who P 1™*°™**™ effeCienc y was es ^^hed for the B.suMis strain MB1510 transformed 
w^ Sco,' prepared plasmid P MB1508. For farther testing of the potential of using this approach, 
the Savmase encod.ng gene of BeolUus dausN was PCR ampIjfied usjng ^ ^ pcR ^ 

Primer #317 (SEQ ID NO: 19) tggcgcaatcggtaccatgggg 

Primer #1 39 Notl (SEQ , D NO: 20) catgtgcatgcggccgcattaacgcgttgccgcttctgcg 
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The resulting -0.8 kb of the Savinase fragment and the pMB1508 plasmid are digested 
with Kpnl and Natl, and the resulting fragments are then purifiied by agarose gel electrophoresis 
The two fragments are ligated, and the ligation mixture is used to transform competent E. coli cells 
which are then plated on LB-agar plates or placed in liquid media for growth overnight at 37'C- 
5 both types of media containing 50-100ug/ml of Ampicillin. After incubation, a plasmid prep is made 
of the liquid culture. The purified plasmid is used for transformation of competent cells of MB1510 
(us.ng 100-10.000 ng of plasmid per transformation. The transformed cells are plated onto TY 
medium with 2% skimmilk and 6 ug/ml of chloramphenicol for selection. After overnight incubation 
at 37-C clearing zones appear around those colonies wherein the integration cassette is 
10 integrated properly into the cells, indicating high Savinase expression. 

This approach can also be used to make highly diverse libraries of any gene of interest 
expressable in B.subtiiis. where rather than a gene encoding one enzyme, any expressable 
polynucleotide is inserted into the plasmid pMB1508 and integrated into the MB1510 strain for 
subsequent screening. 
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Sequence of p lasmid DMB1508 rSEQ ID NO: 21) 

The plasmid pMB1508 has the following components, indicated by basepair positions: 

BP 5186-395: pUC19 sequence from E.co// clone ATCC 37254, Vieira J. Messing J. The 
PUC plasmids. an M13mp7-derived system for insertion mutagenesis and sequencing with 
20 synthetic universal primers. Gene 19: 259-268, 1982. 

BP 396-1021: EcoR I cloning site (BP396-401) and the CrylllA mRNA stabilising 
element. (Described in WO 9634963-A1) 

BP 1022-1412: Encodes the Pre-Pro sequence of Savinase and the Atofl cloning site 
(Pre-Pro part described in eg. WO 9623073-A1, the A/ofl site and the spacing between the Pre- 
25 Pro and A/ofl was introduced by the PCR primer. 

BP 1413-2512: The 8g/ II cloning site (BP1413-1418) and the Chloramphenicol acetyl- 
transferase operon of pDN 1050 (Described in eg. Diderichsen.B.; Poulsen,G.B, Joergensen.S.T.; 
A useful cloning vector for Bacillus subtilis. Plasmid 30:312 (1993)). 

BP 2513-5185: The polynucleotide region [Pel-end-yflS-citS(start)l downstream of the 
pelB locus of the B.subtilis genome, (as it appeaars from the publication and corresponding 
database of: F. Kunst. N. Ogasawara, I. Moszer, <146 other authors>, H. Yoshikawa, A Danchin. 
'The complete genome sequence of the Gram-positive bacterium Bacillus subtilis" 
Nature (1 997) 390:249-256). 

35 The Bacillus subtilis strain MB151Q 



30 
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MB1510 has the following specific features in and around the pelB locus: 

i) The triple promoter and the mRNA stabilising element including a RBS (Ribosome binding 
sequence). 

ID Flanking segments comprising the following homologous polynucleotide region {yfmD-ytmC- 
yfmB-yfmA-Pel-start] upstream of the triple-promoter, and the polynucleotide region [Pel-end-yflS- 
citS(start)] downstream of the mRNA stabilizing sequence. 

Sequence of MB1510 genomic integration region reFQ in Kin- oof 

BP 1-2873: corresponds to sequence of Bacillus subtilis genome yfmD-ytmC-yfmB- 
yfmA-Pel-start (as it appeaars from the publication and corresponding database of: F. Kunst et al 
"The complete genome sequence of the Gram-positive bacterium Bacillus subtilis" 
Nature (1997) 390:249-256). 

BP 3102-4082: The triple promoter and CrylllA mRNA stabilising element plus RBS 
(Described above in PL3598-37 construct). 

BP 4083-5718: The polynucleotide region [Pel-end-yflS-citS(start)] end of and 
downstream of the pelB locus of the B.subtilis genome (as it appeaars from the publication and 
corresponding database of: F. Kunst, N. Ogasawara, I. Moszer, <146 other authors> H 
Yosh,kawa. A. Danchin. "The complete genome sequence of the Gram-positive bacterium Bacillus 
subtilis" Nature (1997) 390:249-256). 



Example 5. 

Another tail-variant library was constructed. In this library two amino acids were 
introduced at the C-termina. of the 10R protein. Such a TaiHibrary may be made with the method 
described above using the following PCR primers in a PGR reaction using genomic DNA from 
25 B.subtilis 1 0RS as template: 

1605 (SEQ ID NO: 23): gacggccagtgaattcgataaaagtgc 

1 606 (SEQ ID NO: 24): ccagatctctatnktnktgtacggagtctaactccccaagag 

30 wherein N= A, C, GorT; andK = TorG. 

The resulting PCR product was digested with EcoR | and Bgl II and ligated into EcoR I 
and Bgl II digested pMB1508. Hereafter following the principle described above. 
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Chloramphenicol resistant Sac///us subtffis transformants were picked by a robotic 
colony picker from a bioassay plate and transferred into a 384 well microtiter plate (MTP) 
containing 0.05 X TY supplemented with 6 mg/l chloramphenicol (60ul/well). The MTPs were 
incubated at 26°C for 72h. After incubation each well was analyzed for proteolytic activity. 

The thirty Sac///us subtilis transformants with highest proteolytic activity were selected 
for determination of the two tail amino acids in each transformant by DNA sequencing, the 
sequencing results are summaries in table 4 and table 5. 



10 



AA Tail 


No. of transformants 


TL 


4 


TT 


4 


QL 


3 


TP 


3 


LP 


3 


Tl 


2 


IQ 


2 


QP 


2 


PI 


2 


LT 




TQ 




IT 




QQ 




PQ 




Total 


30 



Table 4: column one shows the amino acid sequence of the tail, and column two shows the 
number of Bacillus subtilis transformants sequenced with that particular AA tail sequence. 



Possibilities position 1 


Result 


Possibilities position 2 


Result 


K 


0 


K 


0 


R 


0 


R 


0 
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T 


14 


T 


6 


1 


3 


I 


4 


Q 


6 


Q 


5 


P 


3 


P 


8 


L 


4 


L 


7 


Total 


30 


Total 


30 



Table 5: The table shows the amino acid which could be introduced by the primer used for the 
l.brary construct and the actual findings by DNA sequencing of the thirty colonies isolated from 
screening. 
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Example 6 

Construction of Bacill us subtilis strains L? I 9 HVO L2 HV1 

A Bacillus subtilis strain was made analogously with the construction of the Bacillus 
subtilis strain 10RS, with the DNA coding for the pro-form of the S2A protease from Nocartiopsis 
dassonvittei subsp. Dassonvillei DSM 43235. denoted L2. fused by PGR in frame to the DNA 
cod>ng for the signa. peptide from SAVINASE™ ( a well-known commercial protease derived from 
Baallus clausii, available from Novozymes. Denmark), the resulting strain was denoted BaciHus 
subtilis Sav-L2. 

The DNA sequence including the coding region for the pro-mature S2A protease from 
Nocartopsis dassonvillei subsp. Dassonvillel DSM 43235. as amplified with primers 1423 and 
1475. Is shown in SEQ ,D NO: 25. The corresponding encoded pro-form amino acid sequence for 
the L2 protease is shown in SEQ ID NO: 28. 

1423 (SEQ ID NO: 26): gcttttagttcatcgatcgcatcggctgctccggcccccgtcccccag 
1475 (SEQ ID NO: 27): ggagcggattgaacatgcgattaggtccggatcctgacaccccag 

Two tail-variants of this construct were also made. Tail variant Sav-L2 HVO was 
constructed to have 8 amino acids extra in the C-terminus: QSHVQSAP (SEQ ID NO 3) by using 
the DNA sequence extension inserted in front of the TAA stopcodon which is shown in SEQ ID 
NO: 4. Tail variant Sav-L2 HV1 was constructed to have 4 amino acids extra in the C-terminus- 
QSAP (SEQ ID NO: 5). by using the DNA sequence extension inserted in front of the TAA 
stopcodon which is shown in SEQ ID NO: 6. Both tail variants had the SAVINASE™ signal- 
pept.de encoding sequence fused in frame with the pro-mature encoding sequence just like in 
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The Sav-L2 gene and the two tail-variants Sav-L2 HVO and Sav-L2 HV1 were integrated 
by homologous recombination on the Bacillus aubtilia MB1053 host cell genome as outlined 
above. Chloramphenico. resistant transformants were checked for protease activity on 1% skim 
m* LB-PG agar plates (supplemented with 6 ug/ml chloramphenicol). Some protease positive 
colon.es were further analyzed by DNA sequencing of the insert to confirm the correct DNA 
sequence, and one strain for each construct was selected and denoted B.aubtilis Sav-L2 
B.aubtHia Sav-L2 HVO. and B.aubtilis Sav-L2 HV1. respectively. 

Example 7. 

The three B. aubtilia strains of example 6. were fermented on a rotary shaking table in 
500 ml baffled Erlenmeyer flasks containing 100 ml TY supplemented with 6 mg/l 
chloramphenicol. Six Erlenmeyer flasks for each of the three 8. aubtilia strains were fermented in 
parallel. Two of the six Erlenmeyer flasks were incubated at STC (250 rpm). two at 30°C (250 
rpm). and the last two at 26»C (250 rpm). A sample was taken from each shake flask at day 1 2 
and 3 and analyzed for proteolytic activity. The results are shown in tables 6-8. As it can be seen 
from tables 6-8. the effect of the tails also increases the expression level for the Sav-L2 protease 
from Nocandiopaia daaaonvittei subsp. DaaaonviM DSM 43235 when expressed in B. aubtilia An 
-ncrease of up to 40% is observed in this experiment, but overall improvement is observed for both 
tail-variants at all three temperatures tested. 

Table 6. Relative proteolytic activities at 37°C. 





Day 1 


Day 2 


Day 3 


Sav-L2 


1.0 


. to 


1.0 


Sav-L2 HV1 


1.4 


1.3 


1,2 


Sav-L2 HVO 


1.3 


V . 





Table 7. Relative proteolytic activities at zo'c. 





Day1 


Day 2 


Day 3 


Sav-L2 


1,0 


1.0 


1.0 


Sav-L2HV1 


1.0 


1.2 


1.4 • 


Sav-L2 HVO 


1.1 


._ 1,3 


1,3 


Table 8. Relative prot< 


solytic acti 
Day 1 


vities at 2< 
Day 2 


3°C. 
Day 3 


Sav-L2 


1.0 


1.0 


1.0 


Sav-L2 HV1 


1.3 


1.1 


- 1.1 . 


Sav-L2 HVO 


0.2 


1,1 


1.1 



Example 8 
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*.« •„ ° NA ! e<|Uence ^ P^X" ftom the L2 protease from Noca«**s* 

subsp. DassomrfBei, DSM 43235 b sh™ i„ SEQ ,D NO: 29. and me oorresporlg 
am no aad sequence is snown ,„ SEQ |D N0; ^ A ^ ^ P 9 

to the Bac*s suMtfs * , ORS, but with the DNA coding for the pre-region of the L2 re^g 
.he pm-regton of 10RS. was made. The e^re L210R protease encoding sequence inc. L Z 
region of L2. is shown In SEQ ID NO: 31 . ^ 

Two fail variants of the abo»e construct were also made. Tail variant HVO was 
^structed fo ha»e 8 amino acids extra in me Werminus: QSHVQSAP (SEQ ID NO: 3) with the 

r^r™ 10 N ? 4 insaied in m * ™ s,opMdon - - ~ 

m ^ t ^ rT ^ 10 4 amin0 aCldS *• QSAP (SEQ ID 

NO. 5) me DNA sequence shown in SEO ,D NO: 6 mserted in from of me TAA stopccjn Z 
the encoding sequence. ^ " OT 

The 10RL2) construct and the two tail variants were integrated by homologous 
recomb.nat.on on the Barf,** subWs MB1053 host cel, genome. ChLamphenico re ^n 
ransformants were checked for protease acuity on 1 % skim mi* LB-PG agar Ze 

XZ D jr 6 ^ ^ S - ~ — colonies wfre fu' he 

ana^ed by DNA sequencing of the insert to confirm the comact DNA sequence, and a strain for 

Example 9 

The six B. sum, strains 10RS. 10RS HVO. 10RS HV1. L210R. L2 10 R HVO. and L210R 
HVV w*e emtented on a rotary shaking tabte In 500 m! baflled Ertenreeyer flasks containing 100 
ml TV suppiemented wflh 6 mgfl cnloremphenice, Six Erieomeyer flasks tar each of me B. 
strains we re Wanted in parallel. Two of me six Erienmeyer flasks were incubaed a. 37X (250 

si elk afd" 0 f ,° 7; ^ ** "° " (25 ° ""* A - — ^ 

1. «* ,n tables g-11. As a can be seen from me resufls. me effect of me exchange of the 
Proreg,on from 10R with me proregion from me L2 protease resulted in e surprising 2 
"hprevemem on me express*. ,eva, * the 10R pretease as measured by protect 7Z£, 
me cuiture brom at 3rc. The effect is most prenounced in me two tail variants 

Table 9. Relative proteolytic activities at 37°c. 





Day 1 


Day 2 


Day 3 


10RS 




1.0 


1,0 


10RS HVO 


3.7 


8.9 


3.5 
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3.9 


8.5 


4.3 




1.9 


2.3 


1.6 


I i_l\/n 


5.3 


14.4 


7.3 


L210R HV1 


9.1 


20.9 


7.6 


Table 10. Relative prote< 


alytic activ 
Day 1 


ities at 30* 
Day 2 


Day 3 


10RS 


1.0 


1.0 


_ 1.0 


10RS HVO 


2,8 


3,1.. 


4.3 


10RS HV1 


3.6 


- 3 r 6 _ 


4.9 


L210R 


0.6 


... M .. 


0,9 


L210R HVO 


3.5 


3.2 


4.5 


L210R HV1 


3,7 


3.2 


4.5 


Table 11. Relative proteolytic activities at 26" 


C. 




Day 1 


Day 2 


Day 3 


10RS 


1.0 


1.0 


. 1.0 


10RS HVO 


2,6 


3,0 


2,8 


10RS HV1 


3.7 


3.3 


3.1 


L210R 


0.4 


0.7 


0.4 


U10RHV0 


2.3 


2,1 


1.9 


L210R HV1 


2.2 


V _ 


1,7 



Example 10 

Completely analogously with the above examples 1 through 9, similar experiments are 
earned out with the proteases of the following Nocardiopsis strains: 

(a) Nocardiopsis dassonvillei NRRL 18133 as described in WO 88/03947; 

(b) Nocardiopsis sp. NRRL 18262 as described in WO 88/03947. the DNA and amino acid 
sequences of the protease derived from Nocardiopsis sp. NRRL 18262 are shown in DK 
patent application no. 1996 00013, and WO 01/58276 describes the use in animal feed of 
acd-stable proteases related to the protease derived from Nocardiopsis sp. NRRL no. 18262- 

(c) Nocardiopsis Alba DSM 15647; the amino acid sequence of the protease is SEQ ID NO: 34 
the encoding nucleotide sequence is SEQ ID NO: 33; the gene is isolated from the genomic 
DNA of this strain by PCR-amplification using the two primers: 

1421 (SEQ ID NO: 35): gftcatcgatcgcatcggctgcgaccggccccctcccccagtc 
1 604 (SEQ ID NO: 36): gcggatcctatcaggtgcgcagggtcagacc. 

(d) Nocardiopsis prasina DSM 15648; the amino acid sequence of the protease is SEQ ID NO: 
38, the encoding nucleotide sequence is SEQ ID NO: 37; the gene is isolated from the 
genomic DNA of this strain by PCR-amplification using the two primers: 

1346 (SEQ ID NO: 39): gltcatcgatcgcatcggctgccaccggaccgctcccccagtc 
1602 (SEQ ID NO: 40): gcggatcctattaggtccggagacggacgccccaggag. 



10423.000-DK 



5 



58 

(e) Nocardiopsis prasina DSM 15649; the amino acid sequence of the protease is SEQ ID NO: 
42, the encoding nucleotide sequence is SEQ ID NO: 41; the gene is isolated from the 
genomic DNA of this strain by PCR-amplification using the two primers: 

1603 (SEQ ID NO: 43): gttcatcgatcgcatcggctgccaccggaccactcccccagtc, and 1602 (SEQ ID 
NO: 40). 
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Example 11 

The performance of the Nocardiopsis dassonvillei subspecies dassonvillei DSM 43235 
protease assayed in a monogastric in vitro digestion model. The performance of a purified 
preparation of the mature part of the protease having SEQ ID NO: 28 (prepared as described 
above) was tested in an in vitro model simulating the digestion in monogastric animals In 
particular, the protease was tested for its ability to improve solubilisation and digestion of maize/- 
SBM (maizeA-soybean meal) proteins. In the tables below, this protease is designated "protease of 
the invention." 

The in vitro system consisted of 15 flasks in which maizeASBM substrate was initially 
incubated with HCI/pepsin - simulating gastric digestion - and subsequently with pancreatin - 
emulating intestinal digestion. 10 of the flasks were dosed with the protease at the start of the 
gastric phase whereas the remaining flasks served as blanks. At the end of the intestinal 
•ncubation phase samples of in vitro digesta were removed and analysed for solubflised and 
digested protein. 



Table 12: Outline of in vitro digestion procedure 



Components added 


PH 


Temperature 


Time 
course 


Simulated digestion 
phase 


10 g maizeASBM substrate 
(6:4), 41 ml HCI (0.1 05M) 


3.0 


40°C 


t=0 min 


Mixing 


5 m! HCI (0.1 05M) / pepsin 
(3000 U/g substrate), 1 mL 
protease of the invention 


3.0 


40°C 


t=30 min 


Gastric digestion 


16mJH 2 0 


3.0 


40°C 


t= 1 .0 hour 


Gastric digestion 


7 ml NaOH (0.39M) 


6.8 


40°C 


t=1.5hours 


Intestinal digestion 


~5 mi NaHC0 3 (1M) T 
pancreatin (8mg/gdiet) 


6.8 


40°C 


t=2.0 hours 


Intestinal digestion 


Terminate incubation 


7.0 


40°C 


t=6.Q hours J 
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Conditions 

Substrate: 

PH: 

HCI: 

pepsin: 

pancreatin: 



temperature: 40°C. 
Replicates: 5 
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4 g SBM, 6 g maize (premixed) 

3.0 stomach step/ 6.8-7.0 intestinal step 

0.105 M for 1.5 hours (i.e. 30 min HCI-substrate premixing) 

3000 U /g diet for 1 hour 

8 mg/g diet for 4 hours 



10 Solutions 

0.39 M NaOH 
0.105 M HCI 

0.105 M HCI containing 6000 U pepsin per 5 ml 
1 M NaHC0 3 containing 16 mg pancreatin per ml 
15 125 mM NaAc-buffer, pH 6.0 



20 



25 



30 



35 



Enzyme protein determinations 

The amount of protease enzyme protein Cm what follows, Enzyme Protein is abbreviated 
EP) is calculated on the basis of the A 280 values and the amino acid sequences (amino acid 
compositions) using the principles outlined in S.C.Gill & P.H. von Hippel, Analytical Biochemistry 
182,319-326, (1989). 

Experimental procedu re for in vitm nrf oHd 

The experimental procedure was according to the above outline. pH was measured at time 
1, 2.5, and 5.5 hours. Incubations were terminated after 6 hours and samples of 30 ml were 
removed and placed on ice before centrifugation (10000 x g, 10 min, 4*C). Supematants were 
removed and stored at -20°C. 

Analysis 

All samples were analysed for % degree of protein with the OPA method as well as content 
of solubilised and digested protein using gel filtration. 

DH deter mination bv the OPA-mgthnri 

The Degree of Hydrolysis (DH) of protein in different samples was determined using an 
semi-automated microliter plate based colorimetric method (Nielsen.P.M.; Petersen.D.; 
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2001 66. 642-646). The OPA reagen, was pmpa^ 33 fo,,^. 7 620 Wraborate 

w2T Md T m9 sodiumdodecyl au,phate (SDS) — *— ■»-« 

^opa, T T C ° mPle,ely C ° n8nuln9 - 160 c-phmal-dialdehyde 

1 L 4 e,han °'- ^ ° PA — *— «" O-nfltoflvely to ft. 

meqv/i) was praparad by solubiiising 50 mg seri™ (Marat, Germany, ,„ 500 ml detoniad water. 

aho,* n?r*! SO " ,80n WM P " Pared "* dilulin9 "* sam ^ to absence (280 nm) of 
about WO-—* supematants ^ diWed (100 us|ng an automated ^ 

Wtmnedorf, SvnUenand). All other speotrophetometer readings were performed a, 340 nCno 

TopI "T" ^ inS8,ted ^ IEMS MF ™— ■ ""'and, andX 

o, OPA reagent was automaflcaily dispensed. P.ates were shaken (2 mln; 700 rpm, btf ore 

Zd oT ^ was calculated. Eighnold determinate. Zn^Z 



Estimation of sgjjjbJl&jd and rij^t^f 

,n .• JOl C ° ntenl * SO ' Ubi " Sed Pr0tein " patents from In *bo digested samples was 

ffltered through 0.46 pm poVoarbonate Alters and diluted 0:50, v/v> w^th Hs o. Muted samp.es 
were ohromatographed by HPLC using a Supertax Peptide PE (7.5 x 300 mm) 9 e! 

Gtobai, The eluen, used tor iso^afle eiuflon was 50 mM sodium phosphate buT^ 
7.0, conta^g ,50 mM NaCl. The tote, volume of eluen, per run was 26 m, and I flow rZ^, 

*L * h T ™° ^ 31 214 " m - *• totel ^ «— - P™«-aa Z 

(R -0.9993) was made flam a dMion series of an m v*o digested referenda maizer-SBM sample 
«. known tote, protein oontent The protein determined in mis reference sample wa" cant 

3 0 zzz rrsrsr- - % ~ aoac - <19M> ^ 

The oontent of digested protein was estimated by integrating me ohramatogram araa 
conaspondng ,0 papfldea and ammo adds having a motec.ar mass o, ,500 Daflon J bZ 
<Savo,a.L : GauOver.S.F. Dialysis C rt Par The M, Measurement or Pratein D^bi^T 

35 for Prad^on o, The Digestibte Crade Pratein Content in P,g Fee ds. J. s=i. Food Agr. ,990 » 
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173-178; Boisen,S.; Eggum.B.O. Critical Evaluation of In-vitro Methods for Estimating Digestibility 
in Simple-Stomach Animals. Nutrition Research Reviews 1991, 4. 141-162). To determine the 
1500 Dalton dividing line, the gel filtration column was calibrated using cytochrome C (Boehringer, 
Germany), aprotinin, gastrin I, and substance P (Sigma Aldrich, USA), as molecular mass 
5 standards. 
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Results 



The results shown in Tables 13 and 14 below indicate that the protease increased the 
Degree of Hydrolysis (DH). as well as soluble and digestible protein significantly. 

Table 13: Deoree of Hvdrnlysis (PHY ahsnlute and relative v a l,.» e 



Enzyme 

(dosage In mg EP/kg 
feed) 
Blank 



Protease of the invention 
(100) 



n 



Of total protein 



%DH 



26.84 



28.21 



SD 



0.69 



0.35 



Relative to blank 



%DH 



100.0 



105.1 



%cv 



2.57 



1.25 



Different letters within the same column indicate significant differences (1-way ANOVA. Tukey- 
Kramer test, P<0.05). SD = Standard Deviation. %CV= Coefficient of Variance = (SD/mean value) 
x 100% 



Table 14: Solubilised and digested crud^ protein measured bv Akta HPt r. 



Enzyme 
(dosage in 
mg EP/kg 
feed) 



Blank 



Protease of 
the invention 
(50) 



(100) 



n 



Of total protein 



%dig. 
CP 



54.1 



57.7 



58.9 



SD 



1.1 



1.1 



0.8 



%sol.CP 



90.1 



93.2 



94.8 



SD 



1.1 



1.4 



0.9 



Relative to blank 



%dig.CP 



100.0 



106.7 



108.9 



CV% 



2.0 



1.9 



1.3 



%sol.CP 



100.0 



103.4 



105.2 



CV% 



1.2 



1.5 



0.9 



Different letters within the same column indicate sig nificant differences (1-way ANOVA, Tukey- 
Kramer test. P<0.05). SD = Standard Deviation. %CV = Coefficient of Variance = (SD/mean value) 
X100% 
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Example 12 

Performance of the protease from Nocardiopsis dassonvillei subsp. dassonvillei DSM 
43235 in an aquaculture in vitro model The protease preparation as described in Example 3 was 
tested in an aquaculture in vitro model simulating the digestion in coldwater fish. The in vitro 
system consisted of 15 flasks in which SBM substrate was initially incubated with HCI/pepsin - 
simulating gastric digestion - and subsequently with pancreatin - simulating intestinal digestion. 
10 of the flasks were dosed with the protease at the start of the gastric phase whereas the 
remaining 5 flasks served as blanks. At the end of the intestinal incubation phase samples of in 
vitro digesta were removed and analysed for solubilised and digested protein. 



Outline of aqua in vitro digestion procedure 



Components added 



10 g extruded SBM substrate, 
62 mL HCI (0.155M)/pepsin 
(4000 U/g substrate). 1 mL of 
the protease of the invention 



7mL NaOH (1.1M) 



5 mL NaHCOa (1 M) / pancreatin 
(8 mg /g diet) 



Terminate incubation 



PH 



3.0 



6.8 



6.8 

To 



Temperature 



15°C 



15°C 



15°C 



15°C 



Time 
course 



t=0min 



t=6 hours 



t=7 hours 



t=24 
hours 



Simulated digestion 
phase 



Gastric digestion 



Intestinal digestion 



Intestinal digestion 



Conditions 
15 Substrate: 

PH: 

HCI: 

Pepsin: 

Pancreatin: 
20 Temperature: 15 D C 

Replicates: 5 



10 g extruded SBM 

3.0 stomach step/ 6.8-7.0 intestinal step 
0.155 M for 6 hours 
4000 U /g diet for 6 hours 
8 mg/g diet for 1 7 hours 



Solutions 



1.1 M NaOH 
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0. 155 M HCI / pepsin (4000 U/g diet) 
1 M NaHC0 3 containing 1 6 mg pancreatin/mL 
125 mM NaAc-buffer, pH 6.0 

Experimental procedure for a qua in vitm mortal 

The experimental produce was according to the above outline. pH was measured at time 

1. 5, 8 and 23 hours. Incubations were terminated after 24 hours and samples of 30 mL were 
removed and placed on ice before centrrfugation (13000 x g. 10 min, 0'C). Supematants were 
removed and stored at -20°C. 

Analysis 

AH supematants were analysed using the OPA method <% degree of hydrolysis) and by AKTA 
HPLC to determine solubilised and digested protein (see monogastric example). 

Pre-treatment of in vitro sunam atants with t=ASY SPE column* 

Before analysis on AKTA HPLC supematants from the in vitm system were pretreated 
using solid-phase sample purification. This was done to improve the chromatography and thereby 
prevent unstable elution profiles and baselines. The columns used for extraction were solid phase 
extraction columns (Chromabond EASY SPE Columns from Macherey-Nagel). 2 mL milliQ water 
was eluted through the columns by use of a vacuum chamber (vacuum 0.15 x 100 kPa) 
Subsequently 3 mL in vitro sample was dispensed onto the column and eluted (vacuum 0 1 x 100 
kPa), the first % mL of eluted sample was thrown away and a clean tube was placed beneath the 
column, then the rest of the sample was eluted and saved for further dilution. 

25 Results 

The results shown in Tables 15 and 16 below indicate that the protease significantly 
increased Degree of hydrolysis and protein digestibility. 

Table 15: Degree nf Hvrlroly i* -, /PHI k v tne QPA mgfhnH abso | U te anri r«l a th,„ 



15 



20 



Enzyme 

(mg EP/kg diet) 


n 


Of total protein 


Relative to blank 


%DH 




SD 


%DH 




%cv 


Blank 


5 


21.30 


a 


0.52 


100.0 


a 


2.42 


Protease of the invention (50) 


5 


21.98 


b 


0.22 


103.2 


b 


1.00 
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Different letters within the same column indicate significant differences (1-way ANOVA, Tukey- 
Kramer test, PO.05). SD = Standard Deviation. %CV = Coefficient of Variance = (SD/mean value) 
x 100% 



5 Table 16: Solubilised and digested crude protein measured by AKTA HPLC, absolute and relative 
values 



Enzyme 
(mg EP/kg 
diet) 


N 


Of total protein 


Relative to blank 


%CP 
dig 




SD 


%CP 
so! 




SD 


%CP 
dig 




%cv 


%CP 
sol 




%cv 


Blank 


5 


50.0 


a 


2.2 


89.9 


— d 


3.2 


100.0 


a 


4.5. 


100.0 


„ a 


3.5 


Protease of 
the 

invention 
(50) 


5 


52.3 


0 


1.1 


91.4 


a 


1.5 


104.8 


D 


2.1 


101.7 


a 


1.6 


(100) 


5 


53.4 


D 


0.4 


91.6 


a 


1.0 


107.0 


... b 


0.7 


101.9 


a 


1.1 



Kramer test, P<0.05). SD = Standard Deviation. %CV = Coefficient of Variance = (SD/mean value) 
x 100%. 
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CLAIMS 

1. A secreted polypeptide which has alpha-lytic endopeptidase activity, which polypeptide 
comprises at least three non-polar or uncharged polar amino acids within the last four amino acids 
of the C-terminus of the polypeptide, and which polypeptide: 

(a) comprises an amino acid sequence which is at least 70% identical to the amino acid 
sequence of the mature part of the polypeptide shown in SEQ ID NO: 28; SEQ ID NO: 
33: SEQ ID NO: 47; or SEQ ID NO: 41 

(b) comprises an amino acid sequence which is at least 70% identical to the amino acid 
sequence of the mature part of the polypeptide encoded by the polynucleotide in SEQ 
ID NO: 1; SEQ ID NO: 2; SEQ ID NO: 25; SEQ ID NO: 31; SEQ ID NO: 32; SEQ ID 
NO: 36; or SEQ ID NO: 40; 

(c) comprises a mature part which is a variant of the mature part of the polypeptide having 
the amino acid sequence of SEQ ID NO: 28; SEQ ID NO: 33; SEQ ID NO: 37; or SEQ 
ID NO: 41 comprising a substitution, deletion, extension, and/or insertion of one or 

15 more amino acids; 

(d) is an allelic variant of (a), (b). or (c); or 

(e) is a fragment of (a), <b), (c), or (d). 



2. The polypeptide according to claim 1, which is an artificial variant of a wildtype polypeptide said 
variant having one or more amino-acid(s) added to the C-terminus as compared to the wildtype. 

3. The polypeptide according to claim 2. wherein the one or more added amino acid(s) is (are) 
non-polar or uncharged. 

4. The polypeptide according to claim 3, wherein the one or more added amino acid(s) is one or 
more of Q, S, V, A, or P. 



5. The polypeptide according to claim 2. wherein the one or more added amino acids are selected 
from the group consisting of: QSHVQSAP, QSAP. QP. TL, TT, QL, TP. LP. Tl. IQ, QP, PI, LT, TQ. 

30 IT. QQ.andPQ. 

6. The polypeptide according to any of claims 1 - 5 which comprises a heterologous pro-region 
from a different protease; preferably the pro-region is derived from an S2A or S1E protease, and 
most preferably it is at least 70% identical to the pro-region shown in SEQ ID NO: 30. 
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7. The polypeptide according to any of claims 1 - 6 . which comprises a heterologous secretion 
s,gnal-peptide which is cleaved from the polypeptide when the polypeptide is secreted, preferably 
the heterologous secretion signal peptide is derived from a heterologous protease. 

8. The polypeptide according to claim 7. wherein the heterologous secretion signal peptide 
comprises an amino acid sequence having a sequence identity of at least 70% with the amino 
acid sequence encoded by polynucleotides 1 - 81 of SEQ ID NO: 2. 

9. An isolated polynucleotide encoding a polypeptide as defined in any of claims 1-8. 

10. A recombinant expression vector or polynucleotide construct comprising a polynucleotide as 
defined in claim 9. 

11. A recombinant host cell comprising a polynucleotide as defined in claim 9, or an expression 
vector or polynucleotide construct as defined in claim 10. 

12. The recombinant host cell according to claim 1 1 which is a Bacillus ceil. 

13. A transgenic plant, or plant part, comprising a polynucleotide as defined in claim 9, or an 
expression vector or polynucleotide construct as defined in claim 10. 

14. A transgenic, non-human animal, or products, or elements thereof, comprising a 
polynucleotide as defined in claim 9, or an expression vector or polynucleotide construct as 
defined in claim 10. 

15. A method for producing a polypeptide as defined in any of claims 1 - 8, the method 
comprising: (a) cultivating a recombinant host cell as defined in claim 11 or 12, or a transgenic 
Plant or animal as defined in claims 13 or 14, to produce a supernatant comprising the 
polypeptide, and optionally (b) recovering the polypeptide. 

16. An animal feed additive comprising at least one polypeptide as defined in any of claims 1 - 8- 
and 

. (a) at least one fat-soluble vitamin, and/or 

(b) at least one water-soluble vitamin, and/or 

(c) at least one trace mineral. 
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17. An animal feed composition having a crude protein content of 50 to 800 g/kg and comprising 
at least one polypeptide as defined in any of claims 1 - 8, or at least one feed additive of claim 16. 

5 18. A composition comprising at least one polypeptide as defined in any of claims 1 - 8. together 
with at least one other enzyme selected from amongst phytase (EC 3.1.3.8 or 3.1.3.26); xylanase 
(EC 3.2.1.8); galactanase (EC 3.2.1.89); alpha-galactosidase (EC 3.2.1.22); protease (EC 3.4.-.-) 
phospholipase A1 (EC 3.1.1.32); phospholipase A2 (EC 3.1.1.4); lysophospholipase (EC 3.1.1.5)! 
phospholipase C (3.1.4.3); phospholipase D (EC 3.1.4.4); and/or beta-glucanase (EC 3 2 1 4 or 
10 EC 3.2.1.6). 

1 9. A method for using at least one polypeptide as defined in any of claims 1 - 8, for improving the 
nutritional value of an animal feed, for increasing digestible and/or soluble protein in animal diets, 
for increasing the degree of hydrolysis of proteins in animal diets, and/or for the treatment of 

15 vegetable proteins, the method comprising including the polypeptide(s) in animal feed, and/or in a 
composition for use in animal feed. 

20. A method for using at least one polypeptide as defined in any of claims 1 - 8, comprising 
including the polypeptide(s) in a detergent formulation. 

20 
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ABSTRACT 

A secreted proteolytic polypeptide comprising at least three non-polar or uncharged polar 
amino acids within the last four amino acids of the C-terminus of the polypeptide, encoding 
polynucleotides, expression vectors comprising the polynucleotides, host cell comprising the 
5 polynucleotides, methods for producing said polypeptide, and methods for using the polypeptide: 



) 
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SEQUENCE LISTING 

<110> Novozymes A/s 

<120> improved proteases and methods for producing them 
<130> 1042 3. 000-DK 
<160> 42 

<170> Patentin version 3.2 

<210> 1 
<211> 1062 
<212> DNA 

<213> Nocardiopsis sp. nrrl 18262 
<400> 1 

gctactggag cattacctca gtctcctaca cctgaagcag atgcagtatc gatgcaagaa 60 

gcattacaac gtgatcttga tcttacatca gctgaagctg aggaattact tgctgcacaa 120 

gatacagcct ttgaagttga tgaagctgcc gctgaagcag ctggtgatgc atatggtggt 180 

tcagtattcg atactgaatc actcgaactt actgtactag tgaccgatgc agcagctgtt 240 

gaagctgttg aagccacagg tgcaggtaca gagctcgtat cttatggtat tgatggatta 300 

gatgagatcg tacaagagct taatgcagct gatgccgttc caggtgtagt tggatggtat 360 

cctgatgtag caggtgatac tgttgtctta gaagttcttg aaggctctgg agctgatgtt 420 

tctggacttt tagcagacgc aggagtcgat gcatccgcgg ttgaagtgac cacgtcagat 480 

cagcctgaac tctatgccga tatcattgga ggcctagcgt acacaatggg tggtcgctgc 540 

agcgtaggat ttgcagccac aaatgcagct ggacaacctg gcttcgtgac agctggacat 600 

tgcggccgcg tcggtacaca ggttactatc ggcaatggaa gaggtgtctt tgagcaaagc 660 

gtatttcccg ggaatgatgc tgccttcgtt agaggtacgt ccaactttac gcttactaac 720 

ttagtatcta gatacaacac tggcggatat gcaactgtag caggtcacaa tcaagcacct 780 

attggctcta gcgtctgccg ctcagggtcg actacaggat ggcattgtgg aaccattcaa 840 

gctagaggtc agagcgtgag ctatcctgaa ggtaccgtaa cgaacatgac tcgtacgact 900 

gtatgtgcag aaccaggtga ctctggaggt tcatatatca gcggtacgca agcgcaaggc 960 

gttacctcag gtggatccgg taactgtagg acaggtggca caacgttcta ccaggaagtg 1020 

acaccgatgg tgaactcttg gggagttaga ctccgtacat aa 1062 

<210> 2 
<211> 1143 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> -lfi^fSld bJ%?r?n C f?S m rJ; 1 S o e ^ 0d1 '? 9 a s ?2 Please denoted 
wmiDnrS S ? y u r R n P tr ame to the signal peptide encodinq 
sequence of a heterologous protease, Savinase. encoain 9 

<400> 2 

atgaagaaac cgttggggaa aattgtcgca agcaccgcac tactcatttc tgttgctttt 60 

Page 1 



agttcatcga tcgcatcggc t.cu^sS&^^S^c acctgaagca 
gatgcagtat cgatgcaaga agcattacaa cgtgatcttg atcttacatc agctgaagct 
gaggaattac ttgctgcaca agatacagcc tttgaagttg atgaagctgc cgctgaagca 
gctggtgatg catatggtgg ttcagtattc gatactgaat cactcgaact tactgtacta 
gtgaccgatg cagcagctgt tgaagctgtt gaagccacag gtgcaggtac agagctcgta 
tcttatggta ttgatggatt agatgagatc gtacaagagc ttaatgcagc tgatgccgtt 
ccaggtgtag ttggatggta tcctgatgta gcaggtgata ctgttgtctt agaagttctt 
gaaggctctg gagctgatgt ttctggactt ttagcagacg caggagtcga tgcatccgcg 
gttgaagtga ccacgtcaga tcagcctgaa ctctatgccg atatcattgg aggcctagcg 
tacacaatgg gtggtcgctg cagcgtagga tttgcagcca caaatgcagc tggacaacct 
ggcttcgtga cagctggaca ttgcggccgc gtcggtacac aggttactat cggcaatgga 
agaggtgtct ttgagcaaag cgtatttccc gggaatgatg ctgccttcgt tagaggtacg 
tccaacttta cgcttactaa cttagtatct agatacaaca ctggcggata tgcaactgta 
gcaggtcaca atcaagcacc tattggctct agcgtctgcc gctcagggtc gactacagga 
tggcattgtg gaaccattca agctagaggt cagagcgtga gctatcctga aggtaccgta 
acgaacatga ctcgtacgac tgtatgtgca gaaccaggtg actctggagg ttcatatatc 
agcggtacgc aagcgcaagg cgttacctca ggtggatccg gtaactgtag gacaggtggc 
acaacgttct accaggaagt gacaccgatg gtgaactctt ggggagttag actccgtaca 1140 

1143 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



<210> 3 

<211> 8 

<212> prt 

<213> Artificial sequence 
<220> 



<223> C-terminal amino acid tail exDres<;f»H »c -f..^~„ * 

invention. expressed as fusion to protease of the 

<400> 3 

Gin ser His val Gin ser Ala Pro 



<210> 4 
<211> 24 
<212> dna 

<213> Artificial sequence 
<220> 

Polynucleotide encoding a < 

invention. 



<400> 4 

caatcgcatg ttcaatccgc tcca 

24 

<210> 5 

Page 2 



<211> 4 10423.000-DK.ST25.txt 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> inllSiSn? amin ° aCid tai1 ex P ressed as ^sion to protease of the 

<400> 5 

Gin ser Ala Pro 

<210> 6 

<211> 12 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> ?SM°rrSe»?hl - 1 — rtd » 

<400> 6 
caatcggctc ct 



12 



<210> 7 

<211> 2 

<212> prt 

<213> Artificial sequence 
<220> 



<223> inlStiSS] acid tail ex P ressed a * ^sion to protease of the 

<400> 7 



Gin Pro 
1 



<210> 8 

<211> 6 

<212> DNA 

<213> Artificial sequence 
<220> 



<400> 8 
caacca 



<210> 9 

<211> 1 

<212> prt 

<213> Artificial sequence 
<220> 



<223> inleSiSS] amin ° aCid tai1 jessed as fusion to protease of the 



<400> 9 
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Pro 
1 



1042 3 . 000-DK . ST2 5 . txt 



<210> 10 
<211> 3 
<212> DNA 

<213> Artificial sequence 
<220> 

<22J> s^rr^^^ sssass «*■ tan e^s*, as 

<400> 10 
cca 

3 

<210> 11 
<211> 45 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer #252639 
<400> U 

catgtgcatg tgggtaccgc aacgttcgca gatgctgctg aagag 45 

<210> 12 

<211> 44 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer #251992 

<400> 12 

catgtgcatg tggtcgaccg attatggagc ggattgaaca tgcg 44 

<210> 13 

<211> 44 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer #179541 

<400> 13 

gcgttgagac gcgcggccgc gagcgccgtt tggctgaatg atac 44 

<210> 14 
<213> 43 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer #179542 
<400> 14 

gcgttgagac agctcgagca gggaaaaatg gaaccgcttt ttc 

43 

<210> 15 
<211> 64 
<212> DNA 

<213> Artificial sequence 
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<22 0> 10423 • OOO-DK. ST25 . txt 

<223> Primer #179539 
<400> 15 

ccatttgatc agaattcact ggccgtcgtt ttacaaccat tgcggaaaat agtcataggc 
atcc 



<210> 16 

<211> 60 

<212> dna 

<213> Artificial sequence 
<220> 

<223> Primer #179540 

<400> 16 

ggatccagat ctggtacccg ggtctagagt cgacgcggcg gttcgcgtcc ggacagcaca 

<210> 17 

<211> 37 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer #179154 

<400> 17 

gttgtaaaac gacggccagt gaattctgat caaatgg 

<210> 18 

<211> 37 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer #179153 

<400> 18 

ccgcgtcgac actagacacg ggtacctgat ctagatc 

<210> 19 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer #317 

<400> 19 

tggcgcaatc ggtaccatgg gg 

<210> 20 
<211> 40 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer #139 NotI 
<400> 20 

catgtgcatg cggccgcatt aacgcgttgc cgcttctgcg 

Page 5 



10423 . 000-DK.ST25 . txt 

<210> 21 
<211> 7443 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> sequence of plasmid pMB1508 
<400> 21 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 

accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcaggcgcc 240 

attcgccatt caggctgcgc aactgttggg aagggcgatc ggtgcgggcc tcttcgctat 300 

tacgccagct ggcgaaaggg ggatgtgctg caaggcgatt aagttgggta acgccagggt 360 

tttcccagtc acgacgttgt aaaacgacgg ccagtgaatt cgataaaagt gctttttttg 420 

ttgcaattga agaattatta atgttaagct taattaaaga taatatcttt gaattgtaac 480 

gcccctcaaa agtaagaact acaaaaaaag aatacgttat atagaaatat gtttgaacct 540 

tcttcagatt acaaatatat tcggacggac tctacctcaa atgcttatct aactatagaa 600 

tgacatacaa gcacaacctt gaaaatttga aaatataact accaatgaac ttgttcatgt 660 

gaattatcgc tgtatttaat tttctcaatt caatatataa tatgccaata cattgttaca 720 

agtagaaatt aagacaccct tgatagcctt actataccta acatgatgta gtattaaatg 780 

aatatgtaaa tatatttatg ataagaagcg acttatttat aatcattaca tatttttcta 840 

ttggaatgat taagattcca atagaatagt gtataaatta tttatcttga aaggagggat 900 

gcctaaaaac gaagaacatt aaaaacatat atttgcaccg tctaatggat ttatgaaaaa 960 

tcattttatc agtttgaaaa ttatgtatta tggagctctg aaaaaaagga gaggataaag 1020 

aatgaagaaa ccgttgggga aaattgtcgc aagcaccgca ctactcattt ctgttgcttt 1080 

tagttcatcg atcgcatcgg ctgctgaaga agcaaaagaa aaatatttaa ttggctttaa 1140 

tgagcaggaa gctgtcagtg agtttgtaga acaagtagag gcaaatgacg aggtcgccat 1200 

tctctctgag gaagaggaag tcgaaattga attgcttcat gaatttgaaa cgattcctgt 1260 

tttatccgtt gagttaagcc cagaagatgt ggacgcgctt gaactcgatc cagcgatttc 1320 

ttatattgaa gaggatgcag aagtaacgac aatggcgcaa tcggtaccat ggggtatatc 1380 

aacgcgttaa tccgcggata tatagcggcc gcagatctgg gaccaataat aatgactaga 1440 

gaagaaagaa tgaagattgt tcatgaaatt aaggaacgaa tattggataa agtgggatat 1500 

ttttaaaata tatatttatg ttacagtaat attgactttt aaaaaaggat tgattctaat 1560 

gaagaaagca gacaagtaag cctcctaaat tcactttaga taaaaattta ggaggcatat 1620 

caaatgaact ttaataaaat tgatttagac aattggaaga gaaaagagat atttaatcat 1680 

tatttgaacc aacaaacgac ttttagtata accacagaaa ttgatattag tgttttatac 1740 
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cgaaacataa aacaagaagg atataaattt t.38& S EL££ ett agtgacaagg 1800 

gtgataaact caaatacagc ttttagaact ggttacaata gcgacggaga gttaggtrtat i860 

tgggataagt tagagccact ttatacaatt tttgatggtg tatctaaaac attctctggt 1920 

atttggactc ctgtaaagaa tgacttcaaa gagttttatg atttatacct ttctgatgta 1980 

gagaaatata atggttcggg gaaattgttt cccaaaacac ctatacctga aaatgctttt 2040 

tctctttcta ttattccatg gacttcattt actgggttta acttaaatat caataataat 2100 

agtaattacc ttctacccat tattacagca ggaaaattca ttaataaagg taattcaata 2160 

tatttaccgc tatctttaca ggtacatcat tctgtttgtg atggttatca tgcaggattg 2220 

tttatgaact ctattcagga attgtcagat aggcctaatg actggctttt ataatatgag 2280 

ataatgccga ctgtactttt tacagtcggt tttctaacga tacattaata ggtacgaaaa 2340 

agcaactttt tttgcgctta aaaccagtca taccaataac ttaagggtaa ctagcctcgc 2400 

cggaaagagc gaaaatgcct cacatttgtg ccacctaaaa aggagcgatt tacatatgag 2460 

ttatgcagtt tgtagaatgc aaaaagtgaa atcagctgga ctaaaagggg ccgcagagta 2520 

gaatggaaaa ggggatcgga aaacaagtat ataggaggag acctatttat ggcttcagaa 2580 

aaagacgcag gaaaacagtc agcagtaaag cttgttccat tgcttattac tgtcgctgtg 2640 

ggactaatca tctggtttat tcccgctccg tccggacttg aacctaaagc ttggcatttg 2700 

tttgcgattt ttgtcgcaac aattatcggc. tttatctcca agcccttgcc aatgggtgca 2760 

attgcaattt ttgcattggc ggttactgca ctaactggaa cactatcaat tgaggataca 2820 

ttaagcggat tcgggaataa gaccatttgg cttatcgtta tcgcattctt tatttcccgg 2880 

ggatttatca aaaccggtct cggtgcgaga atttcgtatg tattcgttca gaaattcgga 2940 

aaaaaaaccc ttggactttc ttattcactg ctattcagtg atttaatact ttcacctgct 3000 

attccaagta atacggcgcg tgcaggaggc attatatttc ctattatcag atcattatcc 3060 

gaaacattcg gatcaagccc ggcaaatgga acagagagaa aaatcggtgc attcttatta 3120 

aaaaccggtt ttcaggggaa tctgatcaca tctgctatgt tcctgacagc gatggcggcg 3180 

aacccgctga ttgccaagct ggcccatgat gtcgcagggg tggacttaac atggacaagc 3240 

tgggcaattg ccgcgattgt accgggactt gtaagcttaa tcatcacgcc gcttgtgatt 3300 

tacaaactgt atccgccgga aatcaaagaa acaccggatg cggcgaaaat cgcaacagaa 3360 

aaactgaaag aaatgggacc gttcaaaaaa tcggagcttt ccatggttat cgtgtttctt 3420 

ttggtgcttg tgctgtggat ttttggcggc agcttcaaca tcgacgctac cacaaccgca 3480 

ttgatcggtt tggccgttct cttattatca caagttctga cttgggatga tatcaagaaa 3540 

gaacagggcg cttgggatac gctcacttgg tttgcggcgc ttgtcatgct cgccaacttc 3600 

ttgaatgaat taggcatggt gtcttggttc agtaatgcca tgaaatcatc cgtatcaggg 3660 

ttctcttgga ttgtggcatt catcatttta attgttgtgt attattactc tcactatttc 3720 

tttgcaagtg cgacagccca catcagtgcg atgtattcag catttttggc tgtcgtcgtg 3780 
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gcagcgggcg caccsccact ttta^Ub^S^SU caacctgrtc 
gggtcaacga ctcactacgg ttct 99 a 9 c 9 g«cc 99 tct tcttcggagc aggctacatc 
ccgcaaggca aat 9g t 99 tc catcggattt atcctgtcga ttgttcatat catcgtatgg 
cttgtgatcg 9C99 attat 9 gtggaaagta cta 99 aatat 99 ta 9 aaa 9 a aaaaggcaga 
c 9 c 99 tct 9 c ctttttttat rttcactcct tcgtaagaaa at 99 atttt 9 aaaaatgaga 
aaatttcctg t 9 aaaaat 99 tatgatctag 9 ta 9 aaa 99 a cggetggtgc tgtggtgaaa 
aa 9 c 99 ttcc atttttccct gcaaacaaaa ataat 9999 c tgattgcggc tctgctggtc 
"tgtcattg gtgtgctgac cattacgtt, gccgttcagc atacacaggg agaacggaga 
caggcagagc agctggcggt tcaaacggcg agaaccattt cctatatgcc gccggttaaa 
gagctcattg agagaaaaga cggacatgcg gctcagacgc aagaggtcat tgaacaaatg 
aaagaacaga ctggtgcgtt tgccatttat gttttgaacg aaaaaggaga cattcgcagc 
gcctctggaa aaagcggatt aaagaaactg gagcgcagca gagaaatttt gtttggcggt 
tcgcatgttt ctgaaacaaa agcggatgga cgaagagtg, tcagagggag cgcgccgatt 
ataaaagaac agaagggata cagccaagtg atcggcagcg tgtctgttga ttttctgcaa 
acggagacag ageaaagcat caaaaagcat ttgagaaatt tgagtgtgat tgctgtgctt 
gtactgctgc tcggatttat tggcgccgcc gtgctggcga aaagcatcag aaaggatacg 
ctcgggcttg aaccgcatga gatcgcggct ctatatcgtg agaggaacgc aatgcttttc 
gcgattcgag aagggattat tgccaccaat cgtgaaggcg tcgtcaccat gatgaacgta 
tcggcggccg agatgctgaa gctgcccgag cctgtgatcc atcttcctat agatgacgtc 
atgccgggag cagggctgat gtctgtgctt gaaaaaggag aaatgctgcc flaaccaggaa 
Staagcgtca acgatcaagt gtttattatc aatacgaaag tgatgaatca aggcgggcag 
gcgtatggga ttgtcgtcag c«cagggag aaaacagagc tgaagaagct gatcgacaca 
ttgacagagg ttcgcaaata ttcagaggat ctcagggcgc agactcatg, attttcaaat 
aagctttatg cgattttagg gctgcgtcga cctgcaggca tgcaagcttg gcg ta atca t 
ggtcatagct gtttcctgtg tgaaattgtt atccgctcac aattccacac aacatacgag 
ccggaagcat aaagtgtaaa gcctggggtg cctaatgagt gagctaactc acattaattg 
cgttgcgctc actgcccgct ttccagtcgg gaaacctgtc gtgccagctg cattaatgaa 
tcggccaacg cgcggggaga ggcggtttgc gtattgggcg ctcttccgct tcctcgctca 
ctgactcgct gcgctcggtc gttcggctgc ggcgagcggt atcagctcac tcaaaggcgg 
taatacggtt atccacagaa tcaggggata acgcaggaaa gaacatgtga gcaaaaggcc 
agcaaaaggc caggaaccgt aaaaaggccg cgttgctggc gtttttccat aggctccgcc 
cccctgacg, gcatcacaaa aatcgacgct caagtcagag gtggcgaaac ccgacaggac 
tataaagata ccaggcgttt ccccctggaa gctccc«gr gcgctctcct gttccgaccc 
tgccgcttac cggatacctg tccgcctttc tcccttcggg aagcgtggcg ctttctcata 



3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
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octcacarta tanafai-riw- aB ti. 10423.000-DK.ST25.txt 

gctcacgctg taggtatctc agttcggtgt aggtcgttcg ctccaagctg ggctgtgtgc S880 

acgaaccccc cgttcagccc gaccgctgcg ccttatccgg taactatcgt cttgagtcca 5940 

acccggtaag acacgactta tcgccactgg cagcagccac tggtaacagg attagcagag 6000 

cgaggtatgt aggcggtgct acagagttct tgaagtggtg gcctaactac ggctacacta 6060 

gaaggacagt atttggtatc tgcgctctgc tgaagccagt taccttcgga aaaagagttg 6120 

gtagctcttg atccggcaaa caaaccaccg ctggtagcgg tggttttttt gtttgcaagc 6180 

agcagattac gcgcagaaaa aaaggatctc aagaagatcc tttgatcttt tctacggggt 6240 

ctgacgctca gtggaacgaa aactcacgtt aagggatttt ggtcatgaga ttatcaaaaa 6300 

ggatcttcac ctagatcctt ttaaattaaa aatgaagttt taaatcaatc taaagtatat 6360 

atgagtaaac ttggtctgac agttaccaat gcttaatcag tgaggcacct atctcagcga 6420 

tctgtctatt tcgttcatcc atagttgcct gactccccgt cgtgtagata actacgatac 6480 

gggagggctt accatctggc cccagtgctg caatgatacc gcgagaccca cgctcaccgg 6540 

caccggattt atcagcaata aaccagccag ccggaagggc cgagcgcaga agtggtcctg 6600 

caactttatc cgcctccatc cagtctatta attgttgccg ggaagctaga gtaagtagtt 6660 

cgccagttaa tagtttgcgc aacgttgttg ccattgctac aggcatcgtg gtgtcacgct 6720 

cgtcgtttgg tatggcttca ttcagctccg gttcccaacg atcaaggcga gttacatgat 6780 

cccccatgtt gtgcaaaaaa gcggttagct ccttcggtcc tccgatcgtt gtcagaagta 6840 

agttggccgc agtgttatca ctcatggtta tggcagcact gcataattct cttactgtca 6900 

tgccatccgt aagatgcttt tctgtgactg gtgagtactc aaccaagtca ttctgagaat 6960 

agtgtatgcg gcgaccgagt tgctcttgcc cggcgtcaat acgggataat accgcgccac 7020 

atagcagaac tttaaaagtg ctcatcattg gaaaacgttc ttcggggcga aaactctcaa 7080 

ggatcttacc gctgttgaga tccagttcga tgtaacccac tcgtgcaccc aactgatctt 7140 

cagcatcttt tactttcacc agcgtttctg ggtgagcaaa aacaggaagg caaaatgccg 7200 

caaaaaaggg aataagggcg acacggaaat gttgaatact catactcttc ctttttcaat " 7260 

attattgaag catttatcag ggttattgtc tcatgagcgg atacatattt gaatgtattt 7320 

agaaaaataa acaaataggg gttccgcgca catttccccg aaaagtgcca cctgacgtct 7380 

aagaaaccat tattatcatg acattaacct ataaaaatag gcgtatcacg aggccctttc 7440 
gtc 

7443 

<210> 22 
<2U> 5718 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> sequence of MB15 10 genomic integration region 
<400> 22 

gagcgccgtt tggctgaatg atacaacagt ctcacttcct tactgcgtct ggttgcaaaa 60 
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acgaagaagc 
gcacatcttt 
ggtccgcatc 
ttttccactc 
cgattttgct 
gtgtaaagcc 
aggaatccgc 
cttttagatc 
tatttaatgc 
ggctttttaa 
gctcagcgtc 
cgctgcgtgt 
cttttttgtt 
agaatgaaag 
cttcgtcatg 
ggcttgatga 
tcaacttgtt 
tcattatcac 
gaaagctgcg 
cagcgttacc 
tatcagcagt 
catctgttag 
tatcgatatg 
cggcacgctt 
gttctggtga 
tcatggtcgt 
cccgttccta 
ttctcttcaa 
gaaaacatga 
acagctcatg 
cgtcttagct 
gatcggagcc 
agacgagctc 
tgaaaaacgc 



aaggattccc 
ggcgcttgtt 
atagacttgt 
atctacaatg 
caattgctca 
tagcatttta 
tcttgcaacg 
gttgatgact 
tttagcaatg 
ttcaatcgtc 
agcgatgatt 
gcctacagat 
gtcatctgcg 
ctcaagcaca 
gatcactctg 
acagccggat 
agaataggtg 
atgtaacact 
tttttactgc 
gtgtcttcga 
gacataagga 
tgcaaattca 
aaaatgttcc 
atggaaaatc 
aaagtattgc 
gtgctccgtg 
tcactttacc 
tgttctggaa 
aggaggaata 
aaagatgtag 
cacgacgatg 
aaagaaacag 
cgcaataaaa 
ttragatgaag 



rtcnr^ 1° 423 - OOO-DK. ST25 . tXt 

ctcgcttctc atttgtccta tttattatac acttttttaa 



120 

tcactragact tgatgcctct gaatcttgtc caagtgtcac 180 
ccattttrtca ccgctttgag atttttccag agcgggttcg 240 
gttttgcctt cgttggctga gatgaacaaa atatcaggat 300 
aggctgacct cttgataggc gttatctgac ttcacagcgt 360 
aagatttctc cgtcatagga tgatgatgta tgaagctgga 420 
ccgagaacga tgttgcggtt ttcatctttc ggaagttcgg 480 
tttttgtgct cggcaagctt ttcttttcct tcatcttctt 54 0 
gtcgtaaagc tgtcgatcgt ttcgtcatat gtcgcttcac 600 
ggggcgattt ttttcagctg tttataaatg tttttatggc 660 
aaatcaggct tcaaggaact gatgacctca agattgggtt 720 
gtgtaatcaa tggagctgcc gacaagcttt ttaatcatat 780 
atgcccaccg gcgtaatgcc gagattgtga acggcatcca 840 
accacccgct taggtgtgcc gcttactgtc gtttttcctt 900 
gaatccttag actcgctttt gccgcttccg ttgttattct 960 
acaatgaggc aggcgagcaa taaaacactc atgatggcaa 1020 
cgcatgtcat tcttcctttt ttcagattta gtaatgagaa 1080 
ataatagcat ggcttatcat gtcaatattt ttttagtaaa 1140 
tttctcatga aagcatcatc agacacaaat aagtggtatg 1200 
gacaaaaacg catgggcgtt ggctttagag gtttcgaaca 1260 
aggagagtgc tgagataacc ggacaatttc ttttctattt 1320 
atgtcgccga tattcatgat aatcgagaaa acaaagtcga 1380 
tcggcaaaaa ccgcaagctc gtgaattcct ggtgaacatc 1440 
tgtttgacta aatcactcac aatccaagca ttgtattgct 1500 
attagacata cctcctgctc gtacggataa aggcagcgtt 1560 
cagcggcttc tccttaattt tgatttttct gaaaataggt 1620 
atggacggaa aacaaatagc tactaccatt cctcctgttt 1680 
tctgtttcag gtacagacga tcgggtatga aagaaatata 1740 
tcgacatgaa accagttgta aaagagtata caaatgacga 1800 
aggaattgca gaaaatgggt gttgcgaaag aggatgtata 1860 
acagaacgga acgcctggct gacaacacga acgccaacac 1920 
gtttcaagca cgcggtggga aatatcttca ataaaaaagg 1980 
«cacgaaat cggtttttct gaagatgaag ccgctcaatt 2040 
gaaaagtgct tctctttgtg acagataacg aaaaagtgaa 2100 
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«— - ~-^sasi*giiSUt« 99 « t «« 8 2160 

cggtctttgc ggtgggattt tgcagaatgc cgcaatagga tagcggaaca ttrtcggttc 2220 

tgaatgtccc tcaattrgct attatatttt tgtgataaat tggaataaaa tctcacaaaa 22S0 

tagaaaatgg gggtacatag tggatgaaaa aagtgatgtt agctacggct ttgttttta, 2340 

gattgactcc agctggcgcg aacgcagctg atrtaggcca ccagacgttg ggatccaatg 2400 

atggctgggg cgcgtactcg accggcacga caggcggatc aaaagcatcc tcctcaaatg 2460 

tgtataccgt cagcaacaga aaccagcttg tctcggcatt agggaaggaa acgaacacaa 2S20 

cgccaaaaat catttatatc aagggaacga ttgacatgaa cgtggatgac aatctgaagc 2580 

cgcttggcct aaatgactat aaagatccgg agtatgattt ggacaaatat ttgaaagcct 2640 

atgatcctag cacatggggc aaaaaagagc cgtcgggaac acaagaagaa gcgagagcac 2700 
gctctcagaa aaaccaaaaa gcacgggtca tggtggatat ccctgcaaac acgacgatcg 



2640 

2700 

tcggttcag, gactaacgct aaagtcgtgg gaggaaactt ccaaatcaag agtgataacg 2820 

tcattattcg caacattgaa ttccaggatg cctatgacta ttttccgcaa tggttgtaaa 2880 
acaacoorra nt fl «f^ ^ . . 



acgacggcca gtgaattctg atcaaatggt tcagtgagag cgaagcgaac acttgatttt 
ttaattttct atcttttata ggtcattaga gtatacttat ttgtcctata aactatttag 
cagcataata gatttattga ataggtcatt taagttgagc atattagagg aggaaaatct 
tggagaaata tttgaagaac ccgagatcta gatcaggtac cgcaacgttc gcagatgctg 
ctgaagagat tattaaaaag ctgaaagcaa aaggctatca attggtaact gtatctcagc 
ttgaagaagt gaagaagcag agaggctatt gaataaatga gtagaaagcg ccatatcggc 
gcttttcttt tggaagaaaa tatagggaaa atggtacttg ttaaaaattc ggaatattta 
tacaatatca tatgtatcac attgaaagga ggggcctgct gtccagactg tccgctgtgt 
aaaaataagg aataaagggg ggttgacatt attttactga tatgtataat ataatttgta 
taagaaaatg gaggggccct cgaaacgtaa gatgaaacct tagataaaag tgcttttttt 
Ottgcaattg aagaattatt aatgttaagc ttaattaaag ataatatctt tgaattgtaa 
cgcccctcaa aagtaagaac tacaaaaaaa gaatacgtta tatagaaata tgtttgaacc 
ttcttcagat tacaaatata ttcggacgga ctctacctca aatgcttatc taactataga 
atgacataca agcacaacct tgaaaatttg aaaatataac taccaatgaa cttgttcatg 
tgaattatcg ctgtatttaa ttttctcaat tcaatatata atatgccaat acattgttac 3780 
aagtagaaat taagacaccc ttgatagcct tactatacct aacatgatgt agtattaaat 
gaatatgtaa atatatttat gataagaagc gacttattta taatcattac atatttttct 
attggaatga ttaagattcc aatagaatag tgtataaatt atttatcttg aaaggaggga 
tgcctaaaaa cgaagaacat taaaaacata tatttgcacc gtctaatgga tttatgaaaa 
atcattttat cagtttgaaa attatgtatt atggagctct gaaaaaaagg agaggataaa 
gagaaaaggg gatcggaaaa caagtatata ggaggagacc tatttatggc ttcagaaaaa 



2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 



3840 
3900 
3960 
4020 
4080 
4140 
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gacgcaggaa aacagtcagc cgctgtggga 4200 

ctaatcatct ggtttattcc cgctccgtcc ggacttgaac ctaaagcttg gcatttgttt 4260 

gcgatttttg tcgcaacaat tatcggcttt atctccaagc ccttgccaat gggtgcaatt 4320 

gcaatttttg cattggcggt tactgcacta actggaacac tatcaattga ggatacatta 4380 

agcggattcg ggaataagac catttggctt atcgttatcg cattctttat ttcccgggga 4440 

tttatcaaaa ccggtctcgg tgcgagaatt tcgtatgtat tcgttcagaa attcggaaaa 4500 

aaaacccttg gactttctta ttcactgcta ttcagtgatt taatactttc acctgctatt 4560 

ccaagtaata cggcgcgtgc aggaggcatt atatttccta ttatcagatc attatccgaa 4620 

acattcggat caagcccggc aaatggaaca gagagaaaaa tcggtgcatt cttattaaaa 4680 

accggttttc aggggaatct gatcacatct gctatgttcc tgacagcgat ggcggcgaac 4740 

ccgctgattg ccaagctggc ccatgatgtc gcaggggtgg acttaacatg gacaagctgg 4800 

gcaattgccg cgattgtacc gggacttgta agcttaatca tcacgccgct tgtgatttac 4860 

aaactgtatc cgccggaaat caaagaaaca ccggatgcgg cgaaaatcgc aacagaaaaa 4920 

ctgaaagaaa tgggaccgtt caaaaaatcg gagctttcca tggttatcgt gtttcttttg 4980 

gtgcttgtgc tgtggatttt tggcggcagc ttcaacatcg acgctaccac aaccgcattg 5040 

atcggtttgg ccgttctctt attatcacaa gttctgactt gggatgatat caagaaagaa 5100 

cagggcgctt gggatacgct cacttggttt gcggcgcttg tcatgctcgc caacttcttg 5160 

aatgaattag gcatggtgtc ttggttcagt aatgccatga aatcatccgt atcagggttc 5220 

tcttggattg tggcattcat cattttaatt gttgtgtatt attactctca ctatttcttt 5280 

gcaagtgcga cagcccacat cagtgcgatg tattcagcat ttttggctgt cgtcgtggca 5340 

gcgggcgcac cgccgctttt agcagcgctg agcctcgcgt tcatcagcaa cctgttcggg 5400 

tcaacgactc actacggttc tggagcggct ccggtcttct tcggagcagg ctacatcccg 5460 

caaggcaaat ggtggtccat cggatttatc ctgtcgattg ttcatatcat cgtatggctt 5520 

gtgatcggcg gattatggtg gaaagtacta ggaatatggt agaaagaaaa aggcagacgc 5580 

ggtctgcctt tttttatttt cactccttcg taagaaaatg gattttgaaa aatgagaaaa 5640 

ttccctgtga aaaatggtat gatctaggta gaaaggacgg ctggtgctgt ggtgaaaaag 5700 
cggttccatt tttccctg 



<210> 23 
<211> 27 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 1605 
<400> 23 

9acggccagt gaattcgata aaagtgc 
<210> 24 
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5718 



27 



<211> 42 10423.O00-DK.ST25.txt 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 1606 
<220> 

<221> misc_feature 

<||f> C13)..(13) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (16).. (16) 

<223> n is a, c, g, or t 

<400> 24 

ccagatctct atnktnktgt acggagtcta actccccaag ag 42 

<210> .25 
<211> 1112 
<212> DNA 

<213> Nocardiopsis dassonvillei dsm 43235 
<400> 25 

gcttttagtt catcgatcgc atcggctgct ccggcccccg tcccccagac ccccgtcgcc 60 

gacgacagcg ccgccagcat gaccgaggcg ctcaagcgcg acctcgacct cacctcggcc 120 

gaggccgagg agcttctctc ggcgcaggaa gccgccatcg agaccgacgc cgaggccacc 180 

gaggccgcgg gcgaggccta cggcggctca ctgttcgaca ccgagaccct cgaactcacc 240 

gtgctggtca ccgacgcctc cgccgtcgag gcggtcgagg ccaccggagc ccaggccacc 300 

gtcgtctccc acggcaccga gggcctgacc gaggtcgtgg aggacctcaa cggcgccgag 360 

gttcccgaga gcgtcctcgg ctggtacccg gacgtggaga gcgacaccgt cgtggtcgag 420 

gtgctggagg gctccgacgc cgacgtcgcc gccctgctcg ccgacgccgg tgtggactcc 480 

tcctcggtcc gggtggagga ggccgaggag gccccgcagg tctacgccga catcatcggc 540 

ggcctggcct actacatggg cggccgctgc tccgtcggct tcgccgcgac caacagcgcc 600 

ggtcagcccg gtttcgtcac cgccggccac tgcggcaccg tcggcaccgg cgtgaccatc 660 

ggcaacggca ccggcacctt ccagaactcg gtcttccccg gcaacgacgc cgccttcgtc 720 

cgcggcacct ccaacttcac cctgaccaac ctggtctcgc gctacaactc cggcggctac 780 

cagtcggtga ccggtaccag ccaggccccg gccggctcgg ccgtgtgccg ctccggctcc 840 

accaccggct ggcactgcgg caccatccag gcccgcaacc agaccgtgcg ctacccgcag 900 

ggcaccgtct actcgctcac ccgcaccaac gtgtgcgccg agcccggcga ctccggcggt 960 

tcgttcatct ccggctcgca ggcccagggc gtcacctccg gcggctccgg caactgctcc 1020 

gtcggcggca cgacctacta ccaggaggtc accccgatga tcaactcctg gggtgtcagg 1080 

atccggacct aatcgcatgt tcaatccgct cc 1112 

<210> 26 
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48 10423.OOO-DK.ST25.txt 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 1423 
<400> 26 

gcttttagtt catcgatcgc atcggctgct ccggcccccg tcccccag 48 

<210> 27 

<211> 45 

<212> dna 

<213> Artificial sequence 
<220> 

<223> Primer 1475 

<400> 27 

ggagcggatt gaacatgcga ttaggtccgg atcctgacac cccag 45 

<210> 28 
<211> 354 
<212> prt 

<213> Nocardiopsis dassonvillei DSM 43235 
<400> 28 

Ala Pro Ala Pro val Pro Gin xhr Pro val Ala Asp Asp Ser Ala Ala 
3 10 15 

ser wet Thr Glu Ala Leu Lys Arg Asp Leu Asp Leu Thr Ser Ala Glu 
* Q 25 30 

Ala Glu Glu Leu Leu ser Ala Gin Glu Ala Ala He Glu Thr Asp Ala 

Glu Ala Thr Glu Ala Ala Gly Glu Ala Tyr Gly Gly Ser Leu Phe Asp 

Thr Glu Thr Leu Glu Leu Thr val Leu val Thr Asp Ala Ser Ala val 

/u 75 80 

Glu Ala val Glu Ala Thr Gly Ala Gin Ala Thr val val ser His Gly 
o> 90 95 ' 

Thr Glu Gly Leu Thr Glu val val Glu Asp Leu Asn Gly Ala Glu val 
AUU 105 110 

Pro Glu ser val Leu Gly Trp Tyr Pro Asp val Glu ser Asp Thr Val 

3 2 5 



val val Glu val Leu Glu Gly Ser Asp Ala Asp val Ala Ala Leu Leu 

Ala asp Ala Gly val Asg ser ser Ser val Arg val Glu Glu Ala Glu 

5U 155 160 



160 
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10423 . 000-DK . ST2 5 . txt 

Glu Ala Pro Gin yal Tyr Ala Asp He He Gly Gly Leu Ala Tyr Tyr 
. 165 170 175 

wet Gly Gly Arg Cys ser Val Gly Phe Ala Ala Thr Asn ser Ala Gly 
■ L0U 185 190 

Gin Pro Gly Phe val Thr Ala Gly His Cys Gly Thr Val Gly Thr Gly 
x " 200 205 

val Thr He Gly Asn Gly Thr Gly Thr Phe Gin Asn ser val Phe Pro 

^JL5 220 

Gly Asn Asp Ala Ala Phe val Arg Gly Thr Ser Asn Phe Thr Leu Thr 

" u 235 240 

Asn Leu val ser Arq Tyr Asn Ser Gly Gly Tyr Gin Ser Val Thr Gly 

2 S 5 

Thr ser Gin Ala Pro Ala Gly ser Ala val cys Arg ser Gly ser Thr 
* ou 265 270 

Thr Gly Trp His cys Gly Thr lie Gin Ala Arg Asn Gin Thr val Arg 

280 285 

Tyr Pro Gin Gly Thr val Tgr ser Leu Thr Arg Thr Asn val cys Ala 

Glu Pro Gly Asp ser Gly Gly ser Phe lie ser Gly ser Gin Ala Gin 

3J - U 315 320 

Gly val Thr Ser Gly Gly Ser Gly Asn Cys ser Val Gly Gly Thr Thr 
3 " 330 ' 335 

Tyr Tyr Gin Glu val Thr Pro Met lie Asn Ser Trp Gly val Arg lie 
^ u 345 350 7 

Arg Thr 

<210> 29 
<211> 498 
<212> DNA 

<213> Nocardiopsis dassonvillei dsm 43235 
<400> 29 

gctccggccc ccgtccccca gacccccgtc gccgacgaca gcgccgccag catgaccgag 60 
gcgctcaagc gcgacctcga cctcacctcg gccgaggccg aggagcttct ctcggcgcag 120 
gaagccgcca tcgagaccga cgccgaggcc accgaggccg cgggcgaggc ctacggcggc 180 
tcactgttcg acaccgagac cctcgaactc accgtgctgg tcaccgacgc ctccgccgtc 240 
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10423.000-DK.ST25.txt 

gaggcggtcg aggccaccgg agcccaggcc accgtcgtct cccacggcac cgagggcctg 300 

accgaggtcg tggaggacct caacggcgcc gaggttcccg agagcgtcct cggctggtac 360 

ccggacgtgg agagcgacac cgtcgtggtc gaggtgctgg agggctccga cgccgacgtc 420 

gccgccctgc tcgccgacgc cggtgtggac tcctcctcgg tccgggtgga ggaggccgag 480 

gaggccccgc aggtctac 498 

<210> 30 

<2U> 166 

<212> prt 

<213> Nocardiopsis dassonvillei dsm 43235 

<400> 30 

Ala Pro Ala pro yal Pro Gin Thr Pro Val Ala Asp Asp ser Ala Ala 
x 5 10 15 

Ser Met Thr Glu Ala Leu Lys Arg Asp Leu Asp Leu Thr ser Ala Glu 
20 25 30 

Ala Glu Glu Leu Leu Ser Ala Gin Glu Ala Ala lie Glu Thr Asp Ala 
35 40 45 

Glu Ala Thr Glu Ala Ala Gly Glu Ala Tyr Gly Gly ser Leu Phe Asp 
5U 55 60 

Thr Glu Thr Leu Glu Leu Thr Val Leu val Thr Asp Ala Ser Ala Val 
65 70 75 80 

Glu Ala val Glu Ala Thr Gly Ala Gin Ala Thr val val Ser His Gly 
85 90 95 

Thr Glu Gly Leu Thr Glu val val Glu Asp Leu Asn Gly Ala Glu val 
100 105 110 

Pro Glu ser val Leu Gly Trp Tyr pro Asp val Glu Ser Asp Thr val 
115 120 125 

val val Glu val Leu Glu Gly ser Asp Ala Asp val Ala Ala Leu Leu 
- L3 *' 135 140 

Ala Asp Ala Gly Val Asp ser ser ser val Arg val Glu Glu Ala Glu 

15° 155 160 

Glu Ala Pro Gin val Tyr 
165 

<210> 31 
<211> H46 
<212> DNA 

<213> Artificial sequence 
<220> 
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construct: 10R(proA19l8L2)7 var >ant encoding gene; whole 



<400> 31 



atgaagaaac cgttggggaa aattgtcgca agcaccgcac tactcatttc tgttgctttt 

agttcatcga tcgcatcggc tgctccggcc cccgtccccc agacccccgt cgccgacgac 

agcgccgcca gcatgaccga ggcgctcaag cgcgacctcg acctcacctc ggccgaggcc 

gaggagcttc tctcggcgca ggaagccgcc atcgagaccg acgccgaggc caccgaggcc 

gcgggcgagg cctacggcgg ctcactgttc gacaccgaga ccctcgaact caccgtgctg 

gtcaccgacg cctccgccgt cgaggcggtc gaggccaccg gagcccaggc caccgtcgtc 

tcccacggca ccgagggcct gaccgaggtc gtggaggacc tcaacggcgc cgaggttccc 

gagagcgtcc tcggctggta cccggacgtg gagagcgaca ccgtcgtggt cgaggtgctg 

9 agggctccg acgccgacgt cgccgccctg ctcgccgacg ccggtgtgga ctcctcctcg 

9tccgggtgg aggaggccga ggaggccccg caggtctatg ccgatatcat tggaggccta 

gcgtacacaa tgggtggtcg ctgcagcgta ggatttgcag ccacaaatgc agctggacaa 

cctggcttcg tgacagctgg acattgcggc cgcgtcggta cacaggttac tatcggcaat 

ggaagaggtg tctttgagca aagcgtattt cccgggaatg atgctgcctt cgttagaggt 

acgtccaact ttacgcttac taacttagta tctagataca acactggcgg atatgcaact 

gtagcaggtc acaatcaagc acctattggc tctagcgtct gccgctcagg gtcgactaca 

ggatggcatt gtggaaccat tcaagctaga ggtcagagcg tgagctatcc tgaaggtacc 

gtaacgaaca tgactcgtac gactgtatgt gcagaaccag gtgactctgg aggttcatat 

atcagcggta cgcaagcgca aggcgttacc tcaggtggat ccggtaactg taggacaggt 

ggcacaacgt tctaccagga agtgacaccg atggtgaact cttggggagt tagactccgt 
acataa y 



acataa 

<210> 32 

<211> 1068 

<212> DNA 

<213> Nocardiopsis Alba OSM 15647 

<400> 32 



Bcgaccggcc ccctccccca gtcccccacc ccggatgaag ccgaggccac caccatggtc 
gaggccctcc agcgcgacct cggcctg«c ccctttcagg ccgacgagct cctcgaggcg 
caggccgagt ccttcgagat cgacgaggcc gccaccgegg ccgcagccga ctcctacggc 
ggctccatct tcgacaccga cagcctcacc ctgaccgtcc tggtcaccga cgcctccgcc 
gtcgaggcgg tcgaggccgc cggcgccgag gccaaggtgg tctcgcacgg catggagggc 
ctggaggaga tcgtcgccga cctgaacgcg gccgacgctc agcccggcgt cgtgggctgg 
taccccgaca tccactccga cacg 9tcgtc ctcgaggtcc tcgagggctc 
gtggactccc tgctcgccga cgccgg tgtg gacaccgccg acgtcaaggt 
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60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 

1020 

1080 

1140 

1146 



60 
120 
180 
240 
300 
360 
420 
480 



acc 9 a g ca 9 c ccgagctgta catgggtggg 54Q 

cgctgctcgg tcggcttcgc , ggccaccaac gcctccggcc agcccgggtt cgtcaccgcc 600 

ggccactgcg gcaccgtcgg caccccggtc agcatcggca acggccaggg cgtcttcgag 660 

cgttccgtct tccccggcaa cgactccgcc ttcgtccgcg gcacctcgaa cttcaccctg 720 

accaacctgg tcagccgcta caacaccggt ggttacgcga ccgtctccgg ctcctcgcag 780 

gcggcgatcg gctcgcagat ctgccgttcc ggctccacca ccggctggca ctgcggcacc 840 

gtccaggccc gcggccagac ggtgagctac ccccagggca ccgtgcagaa cctgacccgc 900 

accaacgtct gcgccgagcc cggtgactcc ggcggctcct tcatctccgg cagccaggcc 960 
cagggcgtca cctccggtgg ctccggcaac tgctccttcg gtggcaccac ctactaccag 
gaggtcaacc cgatgctgag cagctggggt ctgaccctgc gcacctga 

<210> 33 
<211> 355 
<212> PRT 

<213> Nocardiopsis Alba dsm 15647 
<400> 33 

Ala Thr Gly Pro Leu Pro Gin ser Pro Thr Pro Asp Glu Ala Glu Ala 

10 15 

Thr Thr Met val Glu Ala Leu Gin Arg Asp Leu Gly Leu ser Pro ser 

25 30 

Gin Ala Asp Glu Leu Leu Glu Ala Gin Ala Glu Ser Phe Glu He Asp 



Glu Ala Ala Thr Ala Ala Ala Ala Asp ser Tyr Gly Gly ser He Phe 

J * 60 

Asp Thr asp ser Leu Thr Leu Thr val Leu Val Thr Asp Ala Ser Ala 

75 80 



val Glu Ala val Glu Ala Ala Gly Ala Glu Ala Lys val va! Ser „,s 

yu 95 

Gly »et Glu Gly Leu Glu G!u He val Ala Asp Leu As„ Ala Ala Asp 



110 



Ala Gin jro Gly val val Gly Jrp Tyr Pro Asp lie His ser Asp Thr 

val val Leu Glu val Leu Glu Gly ser Gly Ala Asp val Asp Ser Leu 

A " 140 

$ Ala Asp Ala Gly val Asp Thr Ala Asp Val Lys val Glu ser Thr 



1020 
1068 



155 



160 
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«. „„ Pro s „ Leu Tyr A «atts /a ». a t Leu 

170 175 

Thr Met G l y gj Arg ^ Ser va , ^ AU 

" LB5 190 
«y Gin jg G l y Phe val nr Aja Gly H(s cys Gly ^ w ^ 

& *" " e ** - «« «ly val Phe f l„ Arg ser va , ne 
go Gly As„ Asp ser AU Phe va, A r S Gly ? ^ As „ phe ^ 

240 

Thr asp Leu va! ser Ar fl Tyr A s„ Thr G , y 61y ^ Ala Thr „ a , sep 

250 255 
Gly ser ser g. Ala Ala n . 61y ^ ^ ^ ^ ^ ^ ^ ^ 

Thr Thr Gly Trp His cys Gly ^ Val „, A „ ^ ^ ^ ^ 

285 

ser Tyr Pro G l„ Gly Thr v? , „. ^ ^ ^ ^ ^ ^ ^ 

Ala olu Pro G , y Asp ser G l y Gly Ser phe , ser ^ ^ ^ 

315 320 
Gin Gly val Thr ser .1, Gly ser Gly ^ ^ ^ ^ ^ ^ ^ 

T*r Tyr Tyr f ,„ „. va , Asn Pro Met Leaser Ser Trp «j Leu Thr 



Leu Arg Thr 
355 

<210> 34 
<211> 43 



< 21|> DNA 
<213> Artificial sequence 
<220> 

<223> Primer 1421 
<400> 34 

9«catcgat cgcatcggct gcgaccgg CC ccctccccca gtc 

<210> 35 
<211> 3i 

<212> DNA 

<213> Artificial sequence 



Page 19 



1042 3 . 000-DK . ST25. txt 

<220> 

<223> Primer 1604 
<400> 35 

gcggatccta tcaggtgcgc agggtcagac c 



15 



ser Met Gin Glu Ala Leu Gin Arg Asp Leu Gly Leu Thr Pro Leu Glu 

" 30 

Ala Asp Glu Leu Leu Ala Ala Gin Asp Thr Ala Phe Glu val Asp Glu 
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31 



<210> 36 
<211> 1062 
<212> DNA 

<213> Nocardiopsis prasina dsm 15648 
<400> 36 

gccaccggac cgctccccca gtcacccacc ccggaggccg acgccgtctc catgcaggag 60 

gcgctccagc gcgacctcgg cctgaccccg cttgaggccg atgaactgct ggccgcccag 120 

gacaccgcct tcgaggtcga cgaggccgcg gccgcggccg ccggggacgc ctacggcggc 180 

tccgtcttcg acaccgagac cctggaactg accgtcctgg tcaccgacgc cgcctcggtc 240 

gaggctgtgg aggccaccgg cgcgggtacc gaactcgtct cctacggcat cgagggcctc 300 

gacgagatca tccaggatct caacgccgcc gacgccgtcc ccggcgtggt cggctggtac 360 

ccggacgtgg cgggtgacac cgtcgtcctg gaggtcctgg agggttccgg agccgacgtg 420 

agcggcctgc tcgccgacgc cggcgtggac gcctcggccg tcgaggtgac cagcagtgcg 480 

cagcccgagc tctacgccga catcatcggc ggtctggcct acaccatggg cggccgctgt 540 

tcggtcggat tcgcggccac caacgccgcc ggtcagcccg gattcgtcac cgccggtcac 600 

tgtggccgcg tgggcaccca ggtgagcatc ggcaacggcc agggcgtctt cgagcagtcc 660 

atcttcccgg gcaacgacgc cgccttcgtc cgcggcacgt ccaacttcac gctgaccaac 720 

ctggtcagcc gctacaacac cggcggttac gccaccgtcg ccggccacaa ccaggcgccc 780 

atcggctcct ccgtctgccg ctccggctcc accaccggct ggcactgcgg caccatccag 840 

gcccgcggcc agtcggtgag ctaccccgag ggcaccgtca ccaacatgac ccggaccacc 900 

gtgtgcgccg agcccggcga ctccggcggc tcctacatct ccggcaacca ggcccagggc 960 

gtcacctccg gcggctccgg caactgccgc accggcggga ccaccttcta ccaggaggtc 1020 

acccccatgg tgaactcctg gggcgtccgt ctccggacct aa 1062 

<210> 37 
<211> 353 
<212> PRT 

<213> Nocardiopsis prasina dsm 15648 
<400> 37 

Ala Thr Gly Pro Leu Pro Gin ser Pro Thr Pro Glu Ala Asp Ala val 



10423.000-DK.ST25.txt 
Ala Ala Ala Ala Ala Ala Gly Asp Ala Tyr Gly Gly Ser Val Phe Asp 

Thr Glu Thr Leu Glu Leu Thr Val Leu val Thr Asp Ala Ala sen val 



75 



80 



Glu Ala val Glu Ala Thr Gly Ala Gly Thr Glu Leu Val ser Tyr Gly 

He Glu Gly Leu Asp Glu He lie Gin Asp Leu Asn Ala Ala Asp Ala 

val P ro Gly val val Gly Trp Tyr Pro Asp val Ala Gly Asp Thr val 

Val Leu Glu val Leu Glu Gly ser Gly Ala Asp val ser Gly Leu Leu 

D 140 

Ala Asp Ala Gly Val Asp Ala ser Ala val Glu Val Thr Ser ser Ala 

155 160 

Gin Pro Gl„ Le „ Tyr Ala « p IU I1e , Gly wu AU flir 

170 175 

Gly Arg cvs Ser val Gly Phe Ala Ala Thr Asn Ala Ala Gly Gin 

x " 190 

Pro Gly Phe val Thr Ala Gly His Cys Gly Arg val Gly Thr Gin 



205 



val 



ser lie Gly Asn Gly Gin Gly val Phe Glu Gin ser He Phe Pro Gly 

*o 220 7 

A f n Asp Ala Ala Phe Val Arg Gly Thr Ser Asn Phe Thr Leu Thr Asn 

235 240 

Leu val ser Arg Tyr Asn Thr Gly Gly Tyr Ala Thr val Ala Gly His 

Asn Gin Ala Pro lie Gly ser ser val Cys Arg ser Gly Ser Thr Thr 

270 

Gly TrP T?l Gl " nr Ile SS A1 = *r 9 G!y Gin ser Val Ser Tyr 

285 

Pro Glu Gly Thr Val Thr Asn Met Thr Arg Thr Thr Val cys Ala Glu 

300 

Pro Gly Asp Ser Gly Gly Ser Tyr He ser Gly AS n Gin Ala Gin Gly 

315 320 
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10423.000-DK.ST25.txt 
Val Thr ser Gly g ? Ser Gly Asn Cys ^ ^ ^ 



Arg rnr Gly Gly Thr Thr Phe 
330 335 



Tyr Glu val Thr P ro „« val Sef Tpp My ^ ^ ^ 



Thr 



<210> 38 

<211> 43 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Priraer 1346 

£ <400> 38 

gttcatcgat cgcatcggct gccaccggac cgctccccca gtc 

<210> 39 

<211> 38 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 1602 

<400> 39 

9cggatccta ttaggtccgg agacggacgc cccaggag 

<210> 40 
<211> 1062 
<212> DNA 

<213> Nocardiopsis prasina DSM 15649 
<400> 40 

gccaccgac cactccccca gtcacccacc cc 99 a g!1 cc g acgccgtctc cat g ca 9gag 

P 9 c 9 acctc 99 , c« 9 acccc 9 c« 9 a 99 cc 9 at g aact 9 ct 99 cc 9 c«a 

9 ac cgcct tc9asgtc9a cgaggccgcg gcc9awcc9 ° 

tcc gt «tc 9 acacc g a g ac cc tg9 aac tg acc 9 tcc tgg tC ac cg actc c 9 cc 9cggtc 

2T t93 a " CCaCC " C9CC9MaCC "~-*« cac 999c « 

9 ac gagatcg tc g a 99 a g ct caac 9 cc g cc g ac g cc gtt c cc g9 c gtggt cgg c tg9t ac 
cc 99 ac 9 tc g c 999t9 acac csptcatgct, 9 a g9 tcct gg a^ttcc^ cgccgacgtg 
MC 99 cc tgc tc gccg ac g c c gg c 9 t gg ac 9 cctc 9g c 99 t Wsac caccaccgag 
ca 9 ccc 9 a 9 c tg „ cgccg a catcatc 99 c ggtctg9 cct acaccat g99 c 99ccgctgt 
« 9 tc gg « tc g c gg ccac caac 9 cc g cc ggt ca 9 ccc 9 9gttcgtcac cgccggt J c 
t g t gg cc 9 c 9 t gggc accca g9 t 9 accatc gg caac gg cc ggggcgtm cgagcagtcc 
«ct«cc gg 9 caac g ac 9 c c gC cttc 9 tc c g c 99 aac g t ccaac«cac g c tg aclc 
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43 



38 



60 
120 
180 
240 
300 
360 
420 

480 

540 

600 

660 

720 



ZZTl 9 " aCMCaC ^---^^ca, cca gg c g «c rso 

ir n cc9tct9cc9 — « M « Mcact9C9g caccatcca9 840 

gcccgcggcc agtcggtgag ctaccccgag ggcaccatca craaM *- 

y u 99 cacc 9tca ccaacatgac gcggaccacc 900 

g r a9ccc " c9a ctcc " c " c tcctacatct ~~ ■» 

ITT" caact9CC9C acc99C999a cc ™ cc ^- -020 

acccccatgg tgaactcctg gggcgtccgt ctccggacct aa 



<210> 41 

<211> 353 

<212> prt 

<213> Nocardiopsis prasina dsm 15649 

<400> 41 



Jla Thr Gly Pro , eu pro Gl „ Ser ppo ^ ^ ^ 

10 15 

ser Met Gin Glu Ala Leo Gln Arg Leu Gly ^ ^ 

^ 5 30 
Al. Asp Glu Leu L eu Ala Ala Gl„ AS p Thr Ala P„e Glu val Asp Glu 

Ala Ala Ala Glu A„ Ala Gly Asp Ala Tyr Gly Gly Ser val Phe Asp 

60 

Thr Glu Thr Leu Glu Leu Thr val Leu val Thr Asp ser Ala Ala Val 

75 80 
01" Ala val Glu Ala Thr Gly Ala Gly Tjr Glu Leu val ser Tyr Gly 

He Thr Gly Leu Asp Glu He val Glu Glu Leu As„ Ala Ala Asp Ala 

AU5 110 
val Pro Gly val Val Gly Trp Tyr Pro Asp va, Ala Gly Asp Thr val 

val Leu Glu val Leu Glu Gly ser S1 y Ala Asp Val Gly Gly Leu Leu 

Ala Asp Ala Gly va! Asp Ala ser Ala val Glu val Thr Thr Thr Glu 

155 160 
Cin Pro Glu Leu Tyr Ala Asp He Tie Glv Gly Leu Ala Tyr Thr Ket 

170 175 

=1/ Gly Ar 9 cys ser val Gly Pne Ala Ala Thr As„ Ala Ala Gly Glu 



190 
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1062 



Pro Gly phe val Thr Ala Gly H1« 0 Jil-?S 0 "? l< - ST25 »<t 

185 200 ^ S Gly Arg Va1 f 1 * «1n val 

Thr g. Gly «„ Gly Arg ?1? va , phe ^ ne ^ ^ 

J- Asp Ala Ala Phe V| l Arg Gly ^ sep phe wu 

235 240 
Lao val ser Arg Tyr Asn Thr Gly 6ly A „ Thr ^ ^ 

250 255 

Asn Gin Ala Pro He Gly ser ser Val cys Arg Ser Gly ser Thr Thr 

* b5 270 

Gly Trp His cys Gly Thr Ha Gl„ Ala Arg Gly Gin ser val Sar Tyr 

285 

Pro Gig Gly Thr Val Thr Asn Met T hr Arg Thr Thr val cys Ala Glu 

3 Pr ? Gly asp ser Gly Gly ser Tyr He ser Gly Asn G,„ Al, Gln Gly 

315 320 

val Thr ser Gly Gly ser Gly Asn Cys Arg Thr Gly Gly Thr Thr 

330 335 



Tyr Gin Glu val Thr Pro Met val Asn ser Trp Gly val Arg L eu Arg 

345 350 y 



Thr 



<210> 42 

<211> 43 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 1603 

<400> 42 

gttcatcgat cgcatcggct gccaccggac cactccccca gtc 
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